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SYNTHETIC TAG GENES 
This application claims the benefit of U.S. provisional application 60/395,530, 
5 iBled July 12, 2002, the disclosures of which are incorporated here by reference in their 
entirety for all purposes. 

FIELD OF INVENTION 
10 This invention relates in general to methods for nucleic acid analysis, and, in 

particular to, synthetic Tag genes useful as assay controls, in assay development, product 
development and validation, and for quality cpntrpl. 

BACKGROUND OF THE INVENTION 
1 5 New teclmology has enabled the production of microarrays smaller than a 

thunibnail that contain hundreds of thousands or more of different molecular probes. 
These techniques are described in U.S. Pat No. 5,143,854, PCT WO 92/10092, and PCT 
WO 90/15070. Microarrays have probes arranged in arrays, each probe ensemble 
assigned a specific location. Microarrays have been produced in which each location has 
20 a scale of, for example, ten microns. The microarrays can be used to determine whether 
target molecules interact with any of the probes on the microarrays. After exposing the 
array to target molecules under selected test conditions, scanning devices can examine 
each location in the array and determine whether a target molecule has interacted with the 
probe at that location. 

25 Microarrays wherein the probes are oligonucleotides ("oligonucleotide arrays") 

show particular promise. Arrays of nucleic acid probes can be used to extract sequence 
information from nucleic acid samples. The samples are exposed to the probes under 
conditions that allow hybridization. The arrays are then scanned to determine to which 
probes the sample molecules have hybridized. One can obtain sequence information by 

30 selective tiling of the probes with p^icular sequences on the arrays, and using 

algorithni$ to compare patterns of hybridization and non-hybridization. This method is 
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useful for sequencing nucleic acids. It is also useful in gene expression monitoring, i.e., 
monitoring the expression of a multiplicity of preselected genes. 

There is a need for exogenous nucleic acid controls ("spikes") for microarray 
analysis. While genotyping applications will benefit from the use of spikes, the need is 
5 especially acute for gene expression monitoring, in which the goal is to determine the 
quantity of each transcript species in a sample. Variations in sample preparation, 
hybridization conditions, and array quality are just some of the factors that influence the 
values determined for the transcript levels of different samples. Constructing large 
databases of samples prepared differently and hybridized to different array types becomes 
1 0 especially challenging. The use of quality-assured control polynucleotides during sample 
preparation and during hybridization to microarrays greatly enhances the ability to 
normalize data and to compare experiments, as well as to monitor each step of the assay. 
Many other applications can also benefit from control spikes. One advantage comes from 
starting with defined quantities of spiked polynucleotides of known sequences. 

15 

SUMMARY OF THE INVENTION 
In one aspect of the invention, a method to construct a synthetic "gene" composed 
of linked synthetic Tag gene sequences is provided. In one embodiment, the genes, about 
500 to 4000 base pairs long, are made by annealing and extending overlapping 60mer 

20 oUgonucleotides followed by cloning into a plasmid vector. Both poly(A)-tailed sense 
(Tag) RNA and antisense (Tag Probe) RNA can be produced from the clones by in- vitro 
transcription. In another embodiment, the genes can be used as exogenous spikes for any 
sample. In another aspect of the invention, these synthetic gene spikes can serve as 
normalization controls in gene expression monitoring experiments and can also be used 

25 to assess system specificity, sensitivity, and dynamic range. These synthetic Tag genes 
are thus useful in assay development, in product development and validation, and for 
quality control. 



30 



3 



PATENT 
Attorney Docket No.:3502.1 

BRIEF DESCRIPTION OF THE DRAWINGS 
The accompanying drawings, which are incorporated in and form a part of this 
specification, illustrate embodiments of the invention and, together with the description, 
serve to explain the principles of the invention: 
5 Figure 1 . Synthesizing genes from oligonucleotides. A) Each 60-mer oligonucleotide is 
designed to overlap by 20 bases two different oligonucleotides encoding the opposite 
strand. In this case the left-most antisense oligonucleotide circularizes the assembly by 
annealing to the 5' end of the leftmost sense oligonucleotide and to the 3' end of the 
rightmost sense oligonucleotide. B) Extension of the annealed oligonucleotides by DNA 

10 polymerase results in a spiral concatamer. C) Multiple rounds of extension, with 

replenishment of nucleotides and polymerase each round, can yield products over 50 kb 
in length (the largest marker band is 12 kb). Assembly of five different genes is shown 
here. D) PCR or restriction eiidonuclease digestion of a concatamer can yield a single 
monomer, which cian then be cloned into a vector. 

15 Figure 2. Tag clone arrangement in a plasmid vector. Each Tag gene consists of linked 
GenFlex™ (Affymetrix, Inc., Santa Clara, CA) Tag sequences, arranged so that 
. transcription from the T3 promoter makes poly(A)-tailed sense (Tag) RNA, and T7 
transcription makes antisense (Tag probe) RNA, 
Figure 3. BigTag clone arrangement in a plasmid vector. 

20 Figure 4, Using Tagl-Q plasmid a control for long-range PCR. The PstI -linearized 
plasmid is depicted in panel A. Three primer-binding sites and two PCR amplicons are 
indicated. Panel B gives the sequences of the priniers that are used to produce the PCR 
products shown in panel C (the two PCRs were performed in triplicate). Plasmid Tagl-Q 
and the primers can be used as quality-assured reagents to control for the long-range 

25 PCRs, fragmentation, labeling, and/or hybridization steps in genotyping assays. 

Figure 5. Site-directed mutagenesis added restriction endonculease recognition sites for 
Xbal ("X") and for EcoRI ("E") to pTaglQ to create plasmid pTaglQ.EX (panel A). 
Panel B is an agarose gel demonstrating the presence the expected products following 
Xbal/EcoRI double digests. 
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DETAILED DESCRIPTION OF THE INVENTION 
The present invention has many preferred embodiments and relies on many 
patents, applications and other references for details known to those of the art. Therefore, 
when a patent, application, or other reference is cited or repeated below, it should be 
5 understood that it is incorporated by reference in its entirety for all purposes as well as for 
the proposition that is recited. 

As used in this application, the singular form "a," "an," and "the" include plural 
references unless the context clearly dictates otherwise. For example, the term "an agent" 
includes a plurality of agents, including mixtures thereof. 
10 An individual is not limited to a human being but may also be other organisms 

including but not limited to mammals, plants, bacteria, or cells derived from any of the 
above. 

Throughout this disclosure, various aspects of this invention can be presented in a 
range forniat. It should be understood that the description in range format is merely for 

1 5 convenience and brevity and should not be construed as an inflexible limitation on the 
scope of the invention. Accordingly, the description of a range should be considered to 
have specifically disclosed all the possible subranges as well as individual numerical 
values within that range. For example, description of a range such as from 1 to 6 should 
be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, 

20 from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers 

within that range, for example, 1, 2, 3, 4, 5, arid 6. This applies regardless of the breadth 
of the range. 

The practice of the present invention may employ, unless otherwise indicated, 
conventional techniques and descriptions of organic chemistry, polymer technology, 

25 molecular biology (including recombinant techniques), cell biology, biochemistry, and 
immunology, which are within the skill of the art. Such conventional techniques include 
polymer array synthesis, hybridization, hgation, and detection of hybridization using a 
label. Specific illustrations of suitable techniques can be had by reference to the example 
hereinbelow. However, other equivalent conventional procedures can, of course, also be 

30 used. Such conventional techniques and descriptions can be found in standard laboratory 
manuals such as Genome Analysis: A Laboratory Manual iSeries (Vols. I-IV), Using 
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Antibodies: A Laboratory Manual, Cells: A Laboratory Manual, PGR Primer: A 
Laboratory Manual, and Molecular Cloning: A Laboratory Manual (all from Cold Spring 
Harbor Laboratory Press), Stryer, Biochemistry, (WH Freeman), Gait, "Oligonucleotide 
Synthesis: A Practical Approach" 1984, IRL Press, London, all of which are herein 
5 incorporated in their entirety by reference for all purposes. 

The present invention can employ solid substrates, including arrays in some 
preferred enibodiments. Methods and techniques applicable to polymer (including 
protein) array synthesis have been described in U,S:S.N 09/536,841, WO 00/58516, U.S. 
Patents Nos. 5,143,854, 5,242,974, 5,252,743, 5,324,633, 5,384,261, 5,424,186, 

10 5,451,683, 5,482,867, 5,491,074, 5,527,681, 5,550,215, 5,571,639, 5,578,832, 5,593,839, 
5,599,695, 5,624,711, 5,631,734, 5,795,716, 5,831,070, 5,837,832, 5,856,101, 5,858,659, 
5,936,324, 5,968,740, 5,974,164, 5,981,185, 5,981,956, 6,025,601, 6,033,860, 6,040,193, 
6,090,555, and 6,136,269, in PCT Applications Nps. PCT/US99/00730 (International 
Publication Number WO 99/36760) and PCT/US 01/04285, and in U.S. Patent 

15 Applications Serial Nos. 09/501,099 and 09/122,216 which are all incorporated herein by 
reference in their entirety for all purposes. 

Patents that describe synthesis techniques in specific embodiments include U.S. 
Patents Nos. 5,412,087, 6,147,205, 6,262,216, 6,310,189, 5,889,165, and 5,959,098. 
Nucleic acid arrays are described in many of the above patents, but the same techniques 

20 are applied to polypeptide arrays, 

The present invention also contemplates niany uses for polymers attached to solid 
substrates. These uses include gene expression monitoring, profiling, library screening, 
genotyping, and diagnostics. Gene expression monitoring, and profiling methods can be 
shown in U.S. Patents Nos. 5,800,992, 6,013,449, 6,020,135, 6,033,860, 6,040,08, 

25 6,177,248 and 6,309,822. Genotyping and uses therefor are shown in USSN 10/013,598, 
and U.S. Patents Nos. 5,856,092, 6,300,063, 5,858,659, 6,284,460 and 6,333,179. Other 
uses are embodied in U.S. Patents Nos, 5,871,928, 5,902,723, 6,045,996, 5,541,061, and 
6,197,506. 

The present invention also contemplates sample preparation methods in certain 
30 preferred embodiments. For example, see the patents in the gene expression, profiling, 
genotyping and other use patents above, as well as USSN 09/854,317, Wu and Wallace, 
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Genomics 4, 560 (1989), Landegren et al, Science 241, 1077 (1988), Burg, U.S. Patent 
Nos. 5,437,990, 5,215,899, 5,466,586, 4,357,421, Gubler et aL, 1985, Biochemica et 
Biophysica Acta, Displacement Synthesis of Globin Complementary DNA: Evidence for 
Sequence Amplification, transcription amplification, Kwoh et aL, Proc, Natl. Acad. Sci. 
5 USA 86, 1 173 (1989), Guatelli et al., Proc. Nat. Acad. Sci. USA, 87, 1874 (1990), WO 
88/10315, WO 90/06995, and 6,361,947. 

The present invention also contemplates detection of hybridization between 
ligands in certain preferred embodiments. See U.S. Pat. Nos. 5,143,854, 5,578,832; 
5,631,734; 5,834,758; 5,936,324; 5,981,956; 6,025,601; 6,141,096; 6,185,030; 6,201,639; 
10 6,218,803; and 6,225,625 and in PCT Application PCT/US99/ 06097 (published as 

W099/47964), each of which also is hereby incorporated by reference in its entirety for 
all purposes. 

The present invention may also make use of various computer program products 
and software for a variety of purposes, such as probe design, management of data, 
15 analysis, and instrument operation. See, U.S. Pat. Nos. 5,593,839, 5,795,716, 5,733,729, 
5,974,164, 6,066,454, 6,090,555, 6,185,561, 6,188,783, 6,223,127, 6,229,91 1 and 
6,308,170. 

Additionally, the present invention may have preferred embodiments that include 
methods for providing genetic information over the internet. See provisional application 
20 60/349,546. 

1. Synthetic Tag genes 

In accordance with one aspect of the present invention, synthetic genes are made 
using Affymetrix GenFlex'^^ (Affymetrix, Inc., Santa Clara, CA) Tag sequences. Tag 

25 sequences are 20mer probes which were selected from all possible 20mers to have similar 
hybridization characteristics and minimal homology to sequences in the public databases. 
See, e.g., U.S. Patent No. 6,458,530 (incorporated here by reference). The list of the 
reverse complements corresponding to the Tag sequences (also sometimes called the Tag 
probes) used to construct the Tag genes is set forth below in Seq. Id. Nos. 1-2050 

30 ■ 



Seq. Id 


3' to 5' sequence 


1 


TAAACTAGCATTGAGCCCAC 
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Seq. Id 


3' to 5' sequence 


2 


AAATCAGCAAACGGGCTCCG 


3 


GAATTGATAATCGCAGCCAC 


4 


GATATAGGAATGGCGCATAC 


5 


CTCATCGGAAGGGCTCGTAA 


6 


ACAGATGGAAAGGCAGTTCT 


7 


TTTGGTAGCTGAGTGCCCTA 


8 


TAACTGGTTTGAGGCCACGC 


9 


TAATTiSAGCTGAGGGCGCAC 


10 


TTGTTGCTACTCTGGCCCGA 


11 


TTCCGTGCATAGTATAGGGA 


12 


TTATGGGACTTATCTCGGGA 


13 


TGTATAGGATTATGTCCGCG 


14 


CTGCTAGGAATATGAGCTAG 


15 


CTTCTtSTCAATAtGGGTACG 


16 


[TATTTCGAGATATGAGGCGC 


17 


TTGATCGTAGATTCGTGAGC 


18 


CGAGATTACAATTCACGAGC 


19 


TGGTGTCTAGCTTCCAGCCT 


20 


TGAGGTCACGGTTCATGCTA 


21 


TGGTTACTGGTATATGCCGC 


22 


CCGAGTGCAGAATAAACCCG 


23 


GCGGTCTCAATACAAACTCA 


24 


GAAGCTACCATACGCGAGCA 


25 


ACGGGATAACAACGCAGCCT 


26 


AGAAGATCAACAGCTCGTCC 


27 


ATAAGATCAAGACCTGTGCC 


28 


ATTAGATTAAGACCAGCGCC 


29 


ATATAATCAAGACTGGCGCG 


30 


AGGATATAACCACtGATCCG 


31 


ACACTATTAAAGCTGCTCCG 


32 


CAATGTATAAGACTCTCGCC 


33 


CACTAATTCAGACGAAGCCG 


34 


GACGCTATCAGACAGATGCA 


35 


CACGCATCAAGACAGTATCG 


36 


CAGCTCCTAAGACTTGGACA 


37 


GGTATCATAGGACATTCGCA 


38 


GGTTACATGGATATAGCACC 


39 


TGTGTTTCAGCTATGCAGGC 


40 


TAATTCGCTGCAACCAGATC 


41 


ATAATTCCAACATGGGAGCC 


42 


CATTdCTTAATATGGGAGCC 


43 


CAATGCTTAATACCGACACG 


44 


GATTGCTTAGACCCTGCACG 


45 


GATTCATTAGACCAGGCGCT 


46 


GATTCTACATGCCACTAGCA 


47 


CCTGCGAACTGGCCTGAATA 


48 


CGCAGCGGAAGGCTCAATAA 


49 


CCTACCGCAAGGCAGGATAA 
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Seq. Id 


3' to 5' sequence 


50 


CCTATGATAAGGCACGCACA 


51 


CGGTGTGGAAGGCTCGTATA 


52 


CGATTGTCAAGGCAGTGATA 


53 


CATTGCGAACTGCATCTAAC 


54 


GATAGTCCAATGGTACTGAC 


55 


GATTCGGTAATGCGCTGTAA 


56 


GACGTTTCAATGCAGCGTAA 


57 


GAGAGTGCAATGCCGACTAA 


58 


GAGATCCGAATGGGCGTACT 


59 


CGAGATCCAAGGGGCATGAT 


60 


AGGTTGCACAGTAACGATGA 


61 


AGAGTTGAACAGCATACCCT 


62 


TATCTGATCGGAGGGGCAGT 


63 


TATTGACTAGTGCGCGTCAG 


64 


TTGGACTATTGGGTATCGCG 


65 


TTGTCAGATTGGATGGGCTC 


66 


TATGCAGAATGGCGTGTATG 


67 


CATTGGATAAGCACTGATGG 


68 


CCCGGAATAAGGCCACGATA 


69 


CTCATAGAATGGACCAGATC 


70 


CATAGATTAAGCACTGAGCC 


71 


CATGATGTAAGCACGGTACC 


72 


CAGGAGCC5AAGCAGATACTC 


73 


GAGAGCAGAAGCACTCACGT 


74 


TACATAGGCTTCAGCATCAC 


75 


TATTATACCTTGATCCGCGC 


76 


TAAACTGCTTGCATACGGCG 


77 


TATAAGCCTTGCAGCGGACC 


78 


TTTAAGCGGTGGATCTAGGT 


79 


TTAATAGCCTTGAGCAGCGA 


80 


ATAAATGCTtGGAACCCTCG 


81 


GAAAGTTCATGGAATCGAGC 


82 


GCAAGGATTTCGACTCAGAC 


83 


CAAAGAATAATCGCTCCTCG 


84 


TAAAGCACTTATGACTCGGC 


85 


TTATAGCATTCTGTAGGCGC 


86 


TGGCTGACATTTGATTAGCC 


8? 


CCTTGAATAATATCTCGGCC 


88 


AGGTCCAGAAATTGCTGCAC 


89 


AGCTCAGGAAATTCTAGGGA 


90 


AGCTATGCAAATTAGAGGCC 


91 


GGTAGGCTAATrTATGGCAC 


92 


CTAATGGAATTCAATGCCGG 


93 


CAACTGGCAATCAATACGGT 


94 


CCAAGCGAATGCAACGTATG 


95 


GCATAGCGAATTG6AGATAC 


96 


GCATGTCGAATGGATGATAC 


97 


GCACGTTCAATGGCTCGACT 
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Seq. Id 


3' to 5' sequence 


98 


GCAGCGCAATCTGTCGAGTA 


99 


AGCAGTGCAAATCCTGATAC 


100 


AGCTTCGCAAATCTC3GTACA 


101 


AGCCTGCGAAATCTACTGAA 


102 


GCAGATCGAATTATGGAGAG 


103 


GCAGAGTCAATTATCATGCC 


104 


CGTrAGGCAATACATTTCCG 


105 


ACTGGTGCAAAGTCTTCGAC 


106 


GGTATATGAATGTGTCGTCC 


107 


GATAGTGCAATCTAGGTGAC 


108 


GCAGTGCAATGGATGTACTA 


109 


GCTAGGCTAATGTCCGGCTA 


110 


GGTAGCCTAATGTGTGCTCA 


111 


GGAGGTGCAATCTtGTGACC 


112 


gagcgccgaatctagtggaa 


113 


GGGAGCGACCTCTAGCTTAT 


114 


GCGGGTCGAATCTCGCTTAA 


115 


CGCCGCGCAAGCTGTATTAA 


116 


CGGCTGCGAAGCTGTGTTAA 


117- 


CATCCGCTAAGATCGGTTAA 


118 


CGTGCAGCATAATGCATCAG 


119 


TGAGAGGTGGATCGCATTCC 


120 


TAGGTGCTAGGATCTCAGCC 


121 


TAGGTATCAGGATTCAGGCC 


122 


TGCGCCAGTGAGTGGTATAT 


123 


CAGCAACGTGGATCAACTAT 


124 


CAGCGGCTAAGATCAATACC 


125 


GCAGCCTAATCTGGCCTAGT 


126 


GGGCCTGTACCTGCAATTCA 


127 


TAGGGCGGACCTGCTGTTAT 


128 


TAAGCCGCCACGGAGTGTTA 


129 


TAAGGCTCTTGAGACGTAGT 


130 


TAAGCCCGATCAGCATGGAC 


131 


TTGCCCGTAGTCAGCTTAGA 


132 


GAAGGAGCGATCAGACACTG 


133 


CAGGCACCAAGTAGCAGAGT 


134 


GGTGCGCCATGTACTCAGTT 


135 


TCAGGCTTATCGAGCGCGTT 


136 


GCAGGCAGATCGACCTAGTT 


137 


ggatagggagtcagatatac 


138 


gcatggttacctacggcaga 


139 


ggaggctgagtcatacgcaa 


140 


ggagcctgacgtagtcgata 


141 


gcggccaattcggggataat 


142 


ggtgctcgacattaggcgat 


143 


gatcccacatagcggacaat 


144 


gatccaatctgtcagcacat 


145 


gagccaatctgactaccagt 
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Seq. Id 


3' to 5* sequence 


A AC 

146 


ToLr 1 \yoi\ 1 A I CdAO i \d I Ovj I A 


147 


TGCTCTGCACTGCTGAOG 1 A 


148 


TCAOOAGOOAGAO 1 G 1 G 1 AG 


149 


AGGAGCAACOATOA 1 GCAOG 


150 


GGGCATACCTATCCCGAGAT 


151 


CGGGCGATACCACTCAGATT 


152 


AGCGGCAACCAGACATACGT 


153" 


CACGCCAT ACCAAGGAGAGT 


154 


CAGTGCATACCAAGCGACGA 


155 


CAGGCAGTACACAATCTACG 


156 


TACGTCGCATCCATAGCTGA 


157 


GAGTGACACCTCAGCAGATA 


158 


CTACAGCACCTCAGGAGAGT 


159 


CTCACGAGATCCAGGAGTAT 


160 


CCAGGACGACAGAGAGATGT 


161 


CGCACACACCTGAGAGAGAT 


162 


GCGCACGCACTCAGATGTAA 


163 


AGACGCTCAACCACGAGAGT 


A A 

164 


A/^/^^/^A/^A /^T/^ A /^T A A A 

GACGCCACAGTCACTAGAGA 


165 


GGCGCACACTGTACTGAGAT 


166 


OGAAGOGOOAG 1 AOOAGA 1 A 


167 


GGGTGGCTAGCTACTGTGAT 


168 


A A A T/^ A T/^T A^/>A*^TA/^ 

GAGACATGATCTACCAGTAC 


169 


GGACGC TTACTCAGCAGTCA 


1 70 


CGGGTGTTACAGAGCTATCA 


171 


CGGGGCTTACACAGACATTA 


A 

172 


/-\ /— \ ^ A 1""T" A A A 1 1 A A 

CGGAGCTTACACATTAGCAG 


173 


✓^T^ A /^y** ATA ^ A /^TT^ A ^ A T ■ 

CTGAGCATACACTTCACGAT 


174 


CCGATCATAACTGTAGATGC 


175 


y^ y^ y^ y~» y^ A ~i~ A A ^^^n/~V y^ ■ . I y"^ A y^ A 

CCGCCGATAACTGCTTGAGA 


176 


A A T A ^^^^ A A T/^T^ A A 

GGCCATATACGAGATGTAGA 


177 


CGTCCCTTAACGGCTGGTAT 


178 


ATACCCAGAACGAGTATGCG 


179 


A T^y^/^ A r^r^ a a /^/^ A t/^ a a T/^T 

ATCCCAGGAACGATGAATCT 


180 


A T/^/^/^/^ A ^> A A Z**/^/^/^/^/^ A^^ A A 

ATCCGCAGAACCGGCGATAA 


181 


GCTCGCCGAAGCGTGTTTAA 


182 


r^f^r^f-^r^r^/^r^r* i\ A r>T/^TT a t a 

GCGCCGCACAGAGTCTTATA 


183 


CGCGCTGCACAGAGCATATA 


184 


y^ y-N y-\ A A A A A T A T 

CCGGTGACACAGGCAGATAT 


185 


y^ /^/^T A T/^ A /^/^ A /~v ~l~/^~r* A T A T 

GCGTATGACCAGGTGTATAT 


loo 


Cj ICa 1 A 1 GAAuCa 1 oO 1 o 1 AO 1 


187 


GTTTCGCACGAGGATGTATC 


188 


GTGCTCGCAGAGGATTTATC 


189 


TAGGCCAGAGTAGCGACTTA 


190 


CAGATCCTAAGAGCAGTTAC 


191 


TAGATGGTAGGAGCGATTCA 


192 


TAAGTCGGTGGAGCATATCA 


193 


taagcgcgtggactcgtaaa 
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oeq. lu 


o lO 0 sequence 


iy4 


1 AAVj I vjoAU 1 VjAoLroUA 1 A 1 




1 A 1 AL^otjjOAo I vjvjA I UAoA 1 




Lr 1 A 1 ALfCjUAA 1 LpOAO I OALpA 


iy 


O 1 A 1 Uo 1 OAAo 1 IjA I ooAUO 


H fib 

198 


T AT Af^ A^T Ar^r^Ti^ ATf^r^ Af^r* 

1 A I AoAU 1 AvjU I CjA I LrtjAvjV 


199 


XA/~*XAr^/^ A/^X/^/^/^O AXr^ A A A 

1 Avj 1 AOoAC? 1 (jVjQjOA I OAAA 


ZOO 


X A A/^/^x A /^x/^ A r^r^ A X/^ A r^T 
T AG ACGTAGTCj AGO A T G AO T 


201 


TGACGAGTTAGGATCTATGC 


202 


TTACG AG T GTAG CGTCCATG 


203 


TCGTCGTAGCATCTCGCAGT 


204 


TCGACGTAGGATCGCAGTAC 


205 


TCAGTATCATGGAGTACGAG 


206 


TGCACTAGATGGGATCGACT 


207 


TGCGATTACTGCCGTCACGT 


20o 


1 GGAO 1 U 1 A 1 GGCAGOOG 1 A 


209 


TGACAGOAG i i GOAG i COGT 


Z1U 


1 AOAOAGoU 1 1 GUAGO 1 oGA 


21 1 


Tf^r^ Ar^r^/^r^ A/^x/^/^/^xi^ axx a 
1 GUAGGGGAG 1 GOG 1 GA 1 1 A 




/^V^/^r* A /^/^/^ A A X/^/^ A X A X/^ A 

GOGOAGvjGAvjA 1 GGA 1 A 1 GA 


213 


r^r^ f^r^ A/^r^f^ A A f^Tr^r^ Ar^T AT A 
GGGGAGGGAAG 1 GGAG 1 A 1 A 


214 


Ar^r^r^/^r^r^ a a/^ a/^/^x/^xaxa 
GAGGGGGGAAGAGG 1 G I A I A 


215 


X/^/*/*^X/^ ^ A X a A X A X/^ 

G 1 GOG 1 GOA 1 AGGGA I AG i G 


OH C 

21b 


xi^r^r*x/^r*o Ar^ Af^r^r^Tr^T ATT 
I GGG 1 GGGAGAGGG 1 G 1 A 1 1 


21 / 


1 GGGA 1 GGAGAGGGG 1 1 G 1 A 


218 


GGAGGAGGAGAGG 1 1 A 1 A i G 


OH r>' 

219 


<^/^l^/^/^ A X/^ A /^/"* A/^/^XXXA /^ A 

GGGGGA 1 GAGGAGG 1 1 I AGA 


OOA 

220 


/^-p/~v/-^ /~»/^ AXAi^A/^/^A T — T/^X/^/^ 

GTGGCGATAGAGOAI 1 CTGG 


OO H 

221 


AACGCGAGAAACOATTTGGG 


222 


A/^/^^A/~^A^AA r^Tr^ A AT/^i^f^ 

AGGCAGACAACTGAATGGGG 


22o 


Ar^/^Af^Af^i^A A r^r^T ac^ a r^Tr^r^ 
AGGAGAGGAAGG 1 AGAG 1 GG 


oo^ 

224 


AGCCAAGGAACCTACATGGG 


225 


/^ir^ f^ A A a r^/^T^/~^ a a t/^ a a 

CCGCAAGCACGTCGAATGAA 


ooc 

226 


/^/**/^/^ A Tr^r^ AOOA/*^A A Af^r^T A 

GGGGA 1 GGAGGAGAAAGG ■ A 


227 


r^f^/^ Af^/^Af^A f^r^T Ar^ AX A XX A 

G CC AG G AG ACGTAG ATATTA 


ooo 

228 


r^r^r^r^ AT A^ A A A Axr^ ax/^ 
GGGGA 1 AGAGAGAGA 1 OA 1 G 


OO A 

229 


X/^ ^X A X A X/^ <^X A A XX^r^ 

1 GG 1 A 1 A I OGG 1 AGA 1 1 OGG 


230 


/-^ Al^r*X AXA A/^r^X/^/^ AXX/^ A/^ 

GAGO 1 A 1 AAGG 1 GGA 1 1 OAO 


231 


A X A A /^XX/^ A XX/^ A 

OGOGGA 1 AAO 1 1 GA I 1 GAGO 


232 


r^T r^r^ r^TT A r^T AT A A 
G I OGGO 1 1 AGO 1 GA 1 AGGGA 


ooo 

233 


A r^~r AT A A T ^ f^T AT 

GGAGCTATAOA 1 GOO lA IOC 




r^Tr^r^c^r^T AC^ ATCir^Tc^r^T at 


235 


TCGGCTTGACGTGCTCGTAT 


236 


GGGCTGTGACTAGACTCTCA 


237 


GCGAATTTAGTAGACGCACA 


238 


GAATCTCGAATAGCGGTACA 


239 


GACAGTTGACATGACAGTAG 


240 


GACATTGACATGGCATAGAC 


241 


GAGTTTAGAATGGTGAGCAC 
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beq. 10 


o to o sequence 


2A2. 


O 1 A 1 1 OCjOAACj I o 1 OoAoOU 




o M A 1 oVjAoAU I oL/ 1 LrVjAOO 


244 


AOOCj I I U I AAA 1 uOvj 1 UAOA 


<^4p 


UUoA 1 A 1 UAAO 1 o 1 UAU 1 AU 


24d 


/^/^/^O A AXr^ A A/^X/^XAO AXA/^ 

UCjLfCjAA 1 (jAA(j I O i AOA 1 AO 


247 


CC ACTATG AAG CG ATATAG G 


248 


^ A /^/^ A /^T/^ A A^A/^AXA /^/^/^/^ 

CACGAGTGAAGAGATACCGC 


249 


GCACTGTTACATGATACCTC 


250 


GCCAGTTACAGTCATGCCTA 


251 


GCGCAGGTAGATCCACTGAT 


252 


GCGTGCGGAGACCTCA 1 1 I A 


253 


GCTCACGAGGCACGG 1 1 lAT 


254 


GCGCCAGTAGCACGCTTATT 


255 


/^T/^ A /^X A A /^T/^ A X^ A X 

G GCTC AGTAG G ACTG ATGAT 


256 


A i^T — O A O A f~^r^ A A A X A /^/^X 

AGTTGCACAGCACAATAGGT 


257 


/~»/^ A ~r A A ^ A O A X A XX A 

CGCCATACAGGACGATATTA 


25o 


GGGGAGAGAGGAGGAG 1 A M 


259 


CCAAGGAGAGTACAGGAl GT 




GGAGAGG 1 AGG 1 GGAGG 1 A 1 




/^Xr^A A/^ AX/^/^XA/^Or^XX/^ AO 

G 1 GAAGA 1 GG 1 AOGG 1 1 GAU 




OO AX AXO A A OOXO A OXO A AO 

GGA 1 A 1 GAAGG 1 GAG 1 GAAO 


ICQ 


OOXAXO A AOOXAXOOO A AO A 

GG 1 A 1 oAAGG 1 A I GGGAAGA 


2p4 


OXXAXO AO AOO AXOOO AO AO 

CT 1 AT GAG AGG A 1 GGGAGAG 


265 


OOOOXOO A AOO AXXXO AO A A 

GGGG 1 GGAAGGAT 1 1 GAGAA 


266 


r^~^~r N a oxxox a axo a 
GGGGGG I 1 AAG 1 I G i AA 1 GA 


267 


r'^'Tf^r^ a /^/^ at^ at a ^/^t — tat 

GGTGGAGGATGATAGGTTAT 


268 


/^T/^/^T A y^^A/^/^ATA /^/^T — TAT 

GTGGTAGCAGGATAGCTTAT 


269 


T A /^^/^T/^/^ A r^r^ A T/^/^T/^ A /^T 

TAGCGTGGAGGATGGTCAGT 


270 


A A <^ <^ y^ T<^ A/^/^A A<^TA T/^ A 

CAACGGTGAGCAACTATCAG 


271 


/^T/*^y^T — r/^y^ A /^y^ A A T/^X AXi^ A 

CTGG TTGGAGGAATGTATGA 


272 


TCGGGTC T AGGATGGTCTAC 


273 


TCGATGCACTGAlGTGAGTA 


274 


T/^/^T ATA Ty^/^/^ A Xy^/^/^ A T/^T 

TGGTATATGCGATGGGATCT 


275 


T A /*^/^y^T/^/^ A /^/^ A T/^ a /^y^T — TA 

TACGGTCCAGCATCAGCTTA 


2/D 


AXO aOXOO A AOOTAO AO AXO 

A 1 GAG 1 GGAAGG 1 AGAGA 1 G 


111 


AXO A A OXO a a OOTO A X A ooo 

A 1 GAAG 1 GAAGG 1 GA I AGGG 


27o 


XAOXXOXO AOO AOOO AOOXA 

1 AG 1 1 G 1 GAGGAGGGAGG 1 A 


279 


T AGTT ATG AG GAG G G G T G G A 


280 


OXX/^XO A O AXO A /^OO A OO AX 

GTTGTGAGATGAGGGAGGAT 


281 


O A /^/^/^ A y^ A A O A y^O A A XOX 

GAGGGAGGAAGAGGAGATGT 




OAOOrSOTO A Ar^Ar^OO ATAO A 


283 


CAGGAGTTAATAGCTCATCC 


284 


TAAGATTAGTTAGCAGCGCC 


285 


GAGTGATTAGCAGACGCCAC 


286 


CGATGATTACCAATGCCACG 


287 


GACTGATTAGCACATCCACA 


288 


GATTATGTAGCACTATGCCC 


289 


GCTATATTACGAGCTATGCG 
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Seq. Id 


3* to 5' sequence 


290 


GTTTATATCGAGGCAGGCCA 


291 


GTTACTATCCGATCAGAGCG 


292 


CGTCATGTACCATCAAGTCG 


293 


GTTATCTACGGATCATGCGA 


294 


CTGCCGTAAGTCTCATGCGA 


295 


CTAGCCGAATACTGCATACA 


296 


CTGCC3TCGAGAATCGCGTTA 


297 


CATACAGGACAATAGGTTCG 


298 


GATACCGACTGATACATTGC 


299 


GATACCGCACGATCAGCAGA 


300 


GTAtATGCAGACTACTGGAG 


301 


TATAGTCGATTATCCCAGCC 


302 


CATAGTACAATATCCCGACG 


303 


CTTGACAGGTACTACCAGTG 


304 


CTGAGACAGCTATCGACACA 


305 


GTGAGTAAGTCTTCCACACG 


306 


TCGGATAtAGTATGGGTCAG 


307 


CGTAGGATAGAATGCACAGT 


308 


CATGATACACACTCACGAGG 


309 


CGGAATCACGACTACATACG 


310 


y^ ^^^^ A ■T",^^ A y^y^ A y^^^y^ A y^ ,^^*T^y*^ A 

GGGTATCACGAGTCACCTCA 


311 


GAGAGAATCGTATGACAGCC 


312 


GAGTATGTAATCTACGTGCG 


313 


GAGTAATCATAGTAGGAGCG 


314 


GACTATATCCAGCACGGAGG 


315 


GACATATAGCTCCACTCAGA 


316 


TAGACCTAGTtGCAGCGCGA 


317 


TACTAGACGTTTCACGGCAG 


318 


GTACATATCTGTCACGGGCA 


319 


TAGTATATCCTACGCGGCTA 


320 


GAGTATATCGCAATGCCAGC 


321 


GAGTTGTCAGATAGGCGACC 


322 


GACGCATGACATATTCCTAC 


323 


GAGACACTTGACAGTAGCCA 


324 


GGCTAGTTACTCAGATCACA 


325 


CGCAATAAGTCTAGCTCACT 


326 


CATGTACTAAGCAGTCACAC 


327 


CTAGtTAATGTGAATCCGGC 


328 


GACTGTGTAATCATTGCAGC 


329 


CGTTCGTGAATCAGCACAGC 


330 


ATTGGGTCACACAGCACAGA 


331 


ATCTGCTGACACACACTAAG 


332 


AGCTGGCTAAATATGTAGGC 


333 


ACTGTCGCAAATATCACACG 


334 


ACTGTCTGAGCAACCAATAG 


335 


GTTACTAGCTGGACGTCAGA 


336 


TTATAGACTGGTGCGGAACA 


337 


TTAGCATACTGTGCGCGAAC 
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3' to 5' senuence 


338 


TGTGCTGACTTAGGTCGAAT 


339 


TCTCGGGACGTTGCGCTATA 




rrGTCCGCGACGTTGGCTATA 


341 


TGTTCGTGACTGTGCGCTAC 




TGTCAGGTAGTGGTCGCTAC 




TTGATGTACTGTGGCTACCG 




1 n AC' rAGAGTHGOGrATGA 

1 1 1 1 r\\jF-\\j 1 vjVJWwwrA i vJJr^ 




TTAGATAGATGTTCGGCCAG 




CTG AAT AG ATT AT AG GCGCG 


^^47 

OH f- 


TnGAATnGnTGTTAnGGAAA 


OHO 


TrAGAPTAnGriTAGrGnATA 


'^AQ 

OH<j 


TGAGGAGTATGTAGGCAGTA 


oou 


TAAGGrGGGTrAGGPTATTT 


OO I 


TATGACCGATGTGGAGGTAT 

1 ry 1 \jrAVy\-/VjrA i vj i \j\^r\\j\j i rA i 


oo^ 


TTAGGAGGGTGGGGGATGTT 




TTCACACGGTCTGCGAGCTT 


00*r 


CTTCAGACAGGAGGAGATAT 


0\J\J 


TCCAGCCGACGTGCGATTTA 


356 


TCCAGCGTACCTGCTTGTAG 


357 


CTCCAGTCAAGTGCTTCGAG 




CTCCAGCGAAGTGATGAGAA 


359 


TGTCAGCGGATCGCCATATA 


3fin 


TCCATGCGAGGATCAGGTAT 


361 


TGCAAGCAGTTCTCAGCGTA 




TGTAGGACCTGTGGTCACTG 


0\}0 


TTTATnGGAGTGCTCAGGCT 


'^R4 


T ATGTG AG C AG G CGC AG CTT 


O^JyJ 


TTrTCGTAGCTGCGCrTAGT 




TATTPnAGPTAGGGACGCAT 


OU r 


TATTTATAGTGGGAGCGAGG 

1 i \ 1 1 1 rA 1 AAV-/ 1 VJV^VJAAVJWVjrAVj 




GAPnTTAGACTGGCACGAGA 




TAPTGATAGCATGGGAGGTT 

1 rAw 1 V3rA 1 rAOOrA I V_JOOAAwV-? 1 1 


O 1 \J 


TGGGATAGCAGTGCGCTCTA 


371 


GCTGATGCACGAGGCCATTA 


379 


GCTGGATCACGAGGCTCATA 


373 


CGCTTTGTACCAGGCCATAG 

\^\J\y III \J 1 AAV/ VyAAVJVJ w V/AA 1 AAVJ 


374 


cgtgattgaccagAcccagt 

V^V7 1 VJAA 1 I VJAAVAVi'AAVJAAwV^V^AAVJ I 




TAOGrTGGATGAGACGGTCA 

1 AAV/VUV/ 1 VJVJAA 1 V/AAvi3AAw VJ VJ 1 V/AA 


376 


ATCCTGAACGCAGAGACACG 


377 


ATCGTTGCACCAGAACTACA 


378 


CTCTCAGGACCAGCATGATA 


379 


TCTGAGCGATCTGCCAGTCA 


380 


GGTGAGACCTATGTATATCG 


381 


TTAGAGTCTTAGGCATGTCG 


382 


TTATAGCCGTAGGCAGGTAC 


383 


CTCTAAGTATTGGACACGCA 


384 


GGTAGGATATAGGACACTGA 


385 


GCTATCGAATGTGCAGTACG 
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5^pn IH 


to B' ^poijpncp 


\J\J\J 


TCTATCCACTGCGGACGAGT 




TC AT ACTC ATGTG C AG CTCT 




TnATCGAGATCGGCCACTGT 




nTTATGATACCAGTCAGCAC 




T ATTG GT AC G G AGTT AG n C n 


Ov7 1 


GTAGATGAPPPAGTTPrAGP 




ririPTGTTAPPGAGTPTPAGA 




TGPTAf^TTAGGAGTATPGPA 




f^GPTTAPTAGPAnTPAPGPA 






oyo 


P;GPATPATAr^APnPTAPGPT 




PAHTPAr^PAATPGPAnPTAA 


ooO 


r:i A TP A r^T A ATHPP; A Pi P A A P 

VJM 1 OMVJ 1 MM I OOwOMOOMMO 




TATPATAf^ATriPr5(^APrir^AT 
1 M 1 L/M 1 MOM 1 VjOvDVjMOOOM 1 




PAPiTPPAPAAGPGPGAGTAA 

OMvJ 1 \j'OMwMMv30000M'^ • MM 




PGTAGPPPAAGTGPPGATAT 




GAPGPAPPAPAGGPTAGTAT 

VJMVi»V3V/MwV-^MvyMV3VJ w 1 MV3 1 M 1 




PTAGPATAPPAGGPGAGAGT 


404 


AGTGPATPAPAAGAGAPTCG 


40S 


GCCATAGACGAGGCAGTATC 




GGAATAPGnTGAGATATACG 

V3V-JMM 1 MVVJV^ 1 wMVJM 1 1 rAwVJ * 


407 


GTTAATnGPTPAGPAGPATT 


4nR 


PAPAAGPGAPPAGAAGPGTT 




TPTTATPGAPPAGGGPGGTT 

1 O 1 1 M 1 V_/ AA W AA Vjj V_J VJ W V_3 V_J 1 1 


4in 


GAPAPTATPPPARAPGPAGT 

OMOMO 1 M 1 OwL/MOMOOwMVj 1 


4.1 1 


TTAPTAnGTTPAGPGPGATP 

1 1 Mw 1 MOO 1 1 vxMOvyOvvS'* ' ^ 




TTPAGATPPTPAGPGTAGTP 

1 1 \-*MOM 1 \^\^ 1 V^MOWO 1 MO 1 O 


41^^ 


TPTPAriATATTPGTAGPAGP 

t I OMOM 1 M 1 1 OO 1 MO^^MOO 


414 
4 1 4 


Tr^TPTATTAf^TAnPTf^Pr^Ar^ 
1 O 1 O 1 M 1 1 MO 1 MOv^ 1 OoOMO 


41 ^ 
4 1 0 


TAr^ ATAPTPTGAGPTAGGAG 

1 MOM 1 MO 1 O 1 OMOO 1 MOOMO 


41R 
4 to 


TGTPTPiTAGATPGTGPGAGT 

1 O 1 vy 1 O wMOM 1 vxO 1 OOOMO 1 


417 
4 1 f 


TTPGGTPTAGPTGnTAGPAT 

1 1 V_^00 1 v> 1 MOO 1 OO 1 MOOM 1 


41R 
4 1 O 


ATPTGf^PriAAPAGr^TGPATA 

M 1 O 1 OOOOrAMwMOO 1 OwM 1 M 


41Q 

4 1 1/ 


AATGPGPGAAAPGGPGATAP 

MM 1 OOOOOMMMOOOOOM 1 MO 


490 


TTTGTPGPAGTAGTPGPATP 

1 1 1 O 1 OOOMO 1 MO 1 OOOrA 1 O 


491 

4^ I 


TGTTGTG P AGTPTP P AGG P A 

1 O 1 1 O 1 OOMO 1 w 1 OOMOOOM 


499 


PATTGTGAAPTPTAPGTPAG 

OM 1 1 O 1 OMMO 1 O ' MOO 1 OMO 


49^^ 

4^0 


PGGATGTPAAGGTPTPAGAG 

OOOM 1 O 1 OMMOO 1 O I OMOMO 


494 

4^4 


PTGPGGPAATAPTPTPAGGT 

O 1 OOOOOMM 1 MO 1 O 1 OMOO 1 


49*^ 


ATGPGGAGAAPPTPTGAPAA 

AA 1 OOOOMOMMOO 1 O 1 wAAWAAAA 


426 


GCGCGTGAATCCTGTGACTA 


427 


GCGCTCTGAATCTGTGAGAA 


428 


GCGCTATGAATGTCAGCTAA 


429 


GCCGAGGTAATGTGATATAC 


430 


GCCGCGTGAATATGAAGATA 


431 


GCGGCGAGAATCTTCCGATA 


432 


GATGGTAGAATCTCTCTCAC 


433 


GCTGCGGGAGACTATCATCT 
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434 


GCTGGATTACGATGCCATAG 


435 


GTTGATTCACGATGGCAGAT 


436 


CTTCACGCAAGTTGTCCAGA 


437 


CTTACGCCAAGTTGTCAGAA 


438 


CTTGCGTCAATAGTCTGAGA 


439 


CCTGTGCGAAGTGTCTTACA 


440 


CTCAGTCCAAGTGGCTGAGA 


441 


CCATAGGGAAGCGCACAGTA 


442 


CGAGCAGTAAGGGCAGATAG 


443 


CTCCGCCTAAGTGGCAGTAA 


444 


TGCGCCTGACGTTCGGATTA 


445 


TGTCCAGTAGCTTGAGAGTC 


446 


GCTCACAGAGTTTGATAGAC 


447 


GCTACAGGAGTGGATATTAC 


448 


GTGACAGTGGCAGATATAAC 


449 


TCGCACTGAGCTGTAATCGA 


450 


TCTTATGAGATGTAGCTCGC 


451 


TCCATCTAGGTGTAGCCGAA 


452 


GTCATAGCAGCTTAGACCTA 


453 


TTATGCTGACTGTGCTCGAC 


454 


TTAGTGCAGTATTAGTCGCG 


455 


TGTCTGACCTTGTAGCCGAC 


456 


TGTTGACACTTGCGTACCGG 


457 


TCTTAGCATGTGCGACGACG 


458 


GCTAAGCTCTTGCACTGAGG 


459 


CATAAGACTTTCCAATCGCG 


460 


CTGAAGCACTTTCCACGAAG 


461 


ctgaacccgttgcagagaga 


462 


CGGAACCGATGGCACAATAT 


463 


GGTGACCGATGGCTACTCAT 


464 


ATGGCGCGAACCCTGTACTA 


465 


GATCGCGGAAGCCACGTATA 


466 


GACGGCAGAATGGAGTATAT 


467 


CGCGGAAGAAAGCATATTTG 


468 


CTGAAGGGGAGGGAATGTAG 


469 


T* A A A y^ y^ y^ A y^~r*y^~t" A 

TCACAGGAGGGTCGACTCTA 


470 


CGACAAGGGATTGAGAGTAG 


471 


ATAAAGGTCATGCCAACCGG 


472 


TATAATGGGTTTGAGGTCCC 


473 


TCTAATGGGTGAGACGAAAC 




1 VjAA I OUUlu 1 vjAU I OVj 1 AAA 


475 


GTGGAGGCAGTGCATCATAA 


476 


GTGGTGTGACCTCGCCATTA 


477 


GGAGATGCACTACGGACTAT 


478 


GAGGATCGAATACTGTCGTA 


479 


CGGAGAGCAAGTCATACGAC 


480 


GCAGGAGAGGGAGTATACTA 


481 


GAGCGTGTAATCCGATCTAA 
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482 


CGATACGGAAGGCGCACTAA 


483 


CGATAGGTAAGGCGACTCAA 


484 


GATGTGGCACGACGATGATA 


485 


TGAGTAGGCAGTCCGATCTA 


486 


TGATAGGCAGTGAGTTCATC 


487 


TTATGGGGAGAGTTGTCATC 


488 


GTTTAGGCACGATGCTGTAT 


489 


GCGTTAGGACCATAGTCTAC 


490 


CCGATGCGACAATACGTTAG 


491 


[TeTAGCGTCCCATAGCGTAG 


492 


pTGTCTGGACCATAGCAGGA 


493 


CTGCTTGCACGATGAGCGAA 


494 


Tagggcggacgatgtagtca 


495 


CCGCTACAAGCATTGGGAAT 


496 


cgggtagaagaatgaatgct 


497 


CCGATGATAAGCTAGTATGC 


498 


GCGGATAGACCATTATTGAC 


499 


GCCAGtAGACCATCGGTGAT 


500 


gcacgcggaccatcgtttat 


501 


GCCGCTCGACCATAGTGATA 


502 


gccgagtcaccatgctgtat 


503 


CACGGGTCACCAAGGGTATT 


504 


GACGGCGACCCAGGTTATAT 


505 


TGTGCGTCAGCAGTTAGTAT 


506 


GCTCGGCTACCAGtCGTTAT 


507 


CGCTGGACACCAGTGTGATA 


508 


CGGTGGAGACCAGATTATAT 


509 


CGCGGGACACCAGCATATTA 


510 


GCTCGCGCATTAGCATATAA 


511 


GCTGACATCCACGCATTGAG 


512 


CGCTGATCCACCGAGATTAG 


513 


ACGCAACCAACAGCGAGTGT 


514 


CACAGACCACAAGCTATGGG 


515 


CCTAGCCGAAGGCATTAGAA 


516 


CCGTAGCTCCAAGGCATGTA 


517 


CAGTGGGCCAGAGCAAGTAA 


518 


GAGGCACGACGAGTCATGTA 


519 


GGTCAGCACTCAGCGATGTA 


520 


GTGTGGCACTAGGCCGATTT 


521 


GGAGACCCGTAGGCATAATT 


522 


cggtgtaaggatgctgaata 


523 


GTCGTGCAGGATGGCATATT 


524 


GTTGCGCACGATGGCAGATT 


525 


GCTGQGACCATCGTCAGATA 


526 


GTCTAGCGATCATGeTCAAT 


527 


CTCTAGGAATCATGCGGAAG 


528 


CTTAGATACTACGAGCACGA 


529 


GfGiACGCTACGTGAGCCTAA 
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530 


TACCGTGTACGTGAGCGCAT 


531 


TACTGCGACGTAGCGAGTCA 


532 


TACTAGGTACTCGCGGCACT 


533 


TACTGCGTACTCGGAGCATA 


534 


GCtCACGTACTCGACAGAAA 


535 


GTGTACTATGTAGCGAGATC 


536 


TAGTAGTACGCTGTCAGAGC 


537 


TGTCGTCGAGTCGTAGAtAC 


538 


GTAGTACACGGAGTGATCCT 


539 


GTAGTACGAGCTGAGACTCT 


540 


GTGACTAGCTCGTAATTCTG 


541 


GAGACACGGTACTAGAGACT 


542 


CAAGAGCGTCACAGAGATGG 


543 


CTATGAGACCACCTCGATAT 


544 


ATTGGGCGACAACGCATTTA 


545 


GTTGCGGTACTAGGGATACT 


546 


GGCGCAGTACGATTGACTAT 


547 


GTGCGACGAGCTTGTCACTA 


548 


tgcgtgtgactattgatacg 


549 


GGTCTGGGAACTTTGCTACG 


550 


CTGTAGCGAAGTTCTCATAC 


551 


TCGGCGTTACGTGGTGACTA 


552 


TGAGCTATACTCGTCGTCAG 


553 


CGGATACTAAGCGTtACGAA 


554 


CGTCATACATAGGACTAGCA 


555 


CGCACGCTACAGACTATTAT 


556 


GCGAGCGTACTATACATAAC 


557 


GGGAGTCTACGACCTCTATA 


558 


CGGTACGCACGACAGTCATA 


559 


gggtacatacgactatacag 


560 


GGCTAGATACACCACTGATA 


561 


CTCTAGGTACACTACTGCAT 


562 


CGTCAGAGACACTGGAATAG 


563 


CTGCGCGTACACTCGGATAT 


564 


CTGTCGGTACACTCGTGAGA 


565 


GTAGACGCCTAGTCAGATAG 


566 


GAGGGACTACGAGCCACTAT 


567 


GTGCGACTACGTGCATCACT 


568 


CGTAGGACACGAGCGTATAT 


569 


GGCGACGACGTGACTATAGT 


570 


CGGtCACGACGACGAGATAT 


571 


GCGTCACACGAGGGGATATT 


572 


GTCGCTCACGATGCGGATTT 


573 


GACGGACAGAtCGTGACATC 


574 


GACCACGTACATGAGCTGAC 


575 


GGCGACGTAGATGATATTGT 


576 


GAGACTGTAATCGCATATGC 


577 


GACTATGTAATCGAGCOTAC 
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578 


G ATAGTCG AATCG CGC3 ATAA 


579 


TATA /^/^ A /^"T"/^ f^f^ /~^~X~ A A 

TATACG G ACTGCG CCCTAG A 


580 


TAGTCTAGCTGAGCCATCGA 


581 


GTATATGACCTAGTGGCACG 


582 


GTGTTGTAGGATGTGCTGCA 


583 


GAGTCTGACATAGGGCACCT 


584 


GAGTTGCACGTAGACGATAC 


585 


GACTGGCGCATAGACACATG 


586 


GAGAGGCTACGAGACTAGAT 


587 


GTGACGGCACTAGCAATATA 


588 


PTGCTGTGACACGCGAGTAT 


589 


CGGCTGTGACACGAGCTATT 


590 


CTGGTGCGACACGCCTATAT 


591 


GTCAGTGGACTAGCCCTACA 


592 


ATCGAGTCAACCGGCCTAGA 


593 


TCGATAGCCTACGTGCCGTT 


594 


GGAGACCTGTACGCACTGTT 


595 


GCGTGACAGCTCGCACTATA 


596 


GCGTAGGTCAGCGAGATTAA 


597 


GCTATACGCACCGTCATGTA 


598 


CGCATACACTCAGCAGAGAT 


599 


CTACTTACAGCAGCGACGAG 


600 


ATCTCGACAGAAGCTAATCG 


601 


A "TA^y^r^ ATA /"k A /N/^^ ATA ^ A ^ 

CATCGGATACACGCATACAG 


602 


ACATAGAACACCGCTTAGGG 


603 


TACTGAGTCCACGCTCGGTA 


604 


y^ A A A ^N*^ A A^ A A^ A% A 

GATACAGCCTACGACCGGAT 


605 


y^ A A A ^^^«- A ^^^^ A y^ A ^\ 

GATACATTACTCGACACGCG 


606 


y^ ^^"^^ A A^ A y^ A y^ A •^y^ y% A A A^ A 

CGCTACAGAGATGCACAGAG 


607 


CGGACTGTAACTGCGATGAA 


608 


GGTGTTATACGTGCATAGCC 


609 


CTCGTATTAAGTGCGGTACC 


610 


TATAGTATCGAGGAGCGACC 


611 


GTATAGTACGTGATAGGCTC 


612 


^^T" A y^ x^ A A y^ y^^nx^ a XNTn A x> a x> 

GTACGATACGTGACTAGAGC 


613 


GTAGGTCGAGCTGCATACTG 


614 


1 1 A y^ a y~*^n a y^Ty^~r"/~v a ~r*y^X^y~»"i" 

TTACAGTAGTCTGCATCCGT 


615 


CTAGTCAAGTCTGCATACAG 


616 


y^T'y^ T'y**T~ A A 1" A y^ A A A 

CTGTCTAATAG G G C GAG ATA 


617 


CTCGGAATACGTGTACCGTG 


618 


TuCGA \ G 1 AuG i GAGvjG I GA 


619 


TCTCGCCGACGTGGTCTTAA 


620 


TCTGTCCACGTGGCGGTTAT 


621 


TGGTGCTGACTCGCTGGTAA 


622 


GTCCCTAGACTCGCAGTGAT 


623 


GCGACAGTAGCTGCAATGAT 


624 


GACGTAATATCGCCACATCA 


625 


GACGAGGTACAGCGCATACA 
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beq. 10 


0 10 0 sec|uence 




oOAovj 1 0 1 AOoAUoOA 1 VjA I 




oLrAvjAo 1 AUov^ AUoUA 1 A I U 




vjAvj ! A(jA I AUAoo 1 OALrVjA 1 




0 AOOO ATO AO AOOTOOO ATT" 
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AO 1 1 oAAA 1 oAoAAoOOOo 1 


7r»'i 
/U 1 


PAPP AP A APPP A ATAPPPTP 
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714 


PAPAATAPAATArtPPTPCiCn 


715 


CAATAAGACATAGGCCGCCG 


716 


CACAACGGATTAGAAGCGCG 


717 


GACATGATATGAGAATGCGC 


718 
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CAAACACGAAGAGCCTGTCG 
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TGTTGATCCATGTCCATAGC 
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nr^PPTTr^r^ APTAPPGPTTAA 


R7A 
Of D 


Tf^PTTPl^APSf^fiTPPPAPTTA 


R77 
0/ # 


TriPPTfir^TAPTriTPPPAPTA 


P7P 
0 r 0 


T^PTTriT^Ar^Ar^TPPiPTAPT 


R7Q 
0 f ^ 


ATriPTTf^PA(^AAPPnTPAGP 


oou 


Tf^APTHTAGf^GAPPPTPAAP 

1 OMw 1 0 1 MVJ OOMVJ 00 1 OM/-\w 


00 1 


TPPTTnGPAGGATf^TPTTAA 


poz 


PGPTPPGGPATGAGTATATP 

00 w 1 w v>0 0 wM 1 \Dr\\J 1 M 1 M 1 w 


000 


TGPTTTGPAGTGAGGPTPTP 


PO*T 


PAATTTGnAAPTAGPPTTCG 


RR') 


TTTGPTGPATPPGGPPTGTA 


RRR 
oou 


TTGGGPPAPTGPGPTPTTTA 


RR7 
00 r 


tptgagpppttggpaPgtta 


RRR 
poo 


GGTGGPPPGATPAnATTPAA 


RRQ 

00>7 


GGPAHGGPAPPTPAGTTTAT 


Ron 


GGGTGGPPPATGPTATPTAA 

VJJV_J VJ 1 V_3\30 w wAA i VJJO 1 AA 1 W 1 AAAA 


RQi 

OC7 1 


PTPTG<^PPPTAPPTATf^GTT 


RQ9 


PPPiP:r^PAPAPPTPTPiATTTA 
oOv^o v3 wMOMOO 1 w 1 \J/-\ 1 1 1 M 


Oi7*3 


nPnnnpnPAPP ATTPATTAT 




r^r^APiPPPAPPATr^AnPTATA 


oyo 


r^AATPTPPAPPAf^f^Pf^riATA 
Oaaaa 1 0 1 OV-/MV-/V«fAAVjOV^VjiVjM 1 aA 


RQR 
opO 


AT AP TPf^PT AP A f^Tfi AT 

OOaa 1 AAV/O 1 vyOO 1 M\^MO 1 OM 1 


RQ7 


TPPTATAGPTPTATPHAPGG 


RQR 
09P 


PTAAPTAGPTHTAAnPnAPP 


RQQ 


APTAGATAAPAPiATGPGPPG 

Mw 1 MOaA 1 MMv>Mv3M 1 


yuu 


PAAPTATPATPAAGAPf^GPG 

VyMMvy 1 M 1 OM 1 OMMOMV/OOOVj 




PAAPAGAGATGAAGPnPGTP 

v^AAAA wMvjrAVJAA 1 OAAMVJVyVj w vj 1 w 




PAAPATATPATAAnCGCGTP 

V^/AAAAV-'M 1 AA 1 V-»AA 1 AAAAV_3 W VJ V»/ V3 1 W 


C7UO 


GPAfiATAGPATPATATAPGP 

0 v/MOM 1 aAOOaa 1 OM 1 M 1 aAVxOw 


QOJ. 


GPAr^ APTGAATTAGPTPTAP 

\D WMOMw 1 \Jr\r\ 1 1 MO\_/ 1 O 1 MO 


yuo 


PTTAATTPATPTAGPGPGAP 
0 1 1 MM 1 1 OM 1 0 1 MO 0 vJ w 0 M 0 




AGGAATCTAACCACGCGCAG 

AAVJVJAAAA 1 W 1 AVAVy V^A^W v-» WvJ vA^\— » 


907 


agaccaataagcaccctggg 


908 


AGACAAACATTCACGCCGGG 


909 


AGAATAAATTACTGCCCGGC 


910 


GAGCACATATTATTACGCCC 


911 


CAGAAGATAATATGCTCGCC 


912 


GAATAGCCGATAATGTCAGC 


913 


GAATAGCTTTAGACTGCCCT 
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06t]. lU 


"V fo C001 lonr'o 
O lU O op(^UciiU^ 


y i*t 


\Jr\r\ 1 V^/-\V^ 1 1 OMM i OMOWMV-' 


y 1 O 


f^(^ATPAPAPTr^PPr;r^APTAT 


y 1 D 


ntrXACOr^ATACXn ACTCTrXATT 
oVjMLrv-^OM 1 MoV^MW 1 1 \Jr\ 1 1 


Qi 7 


rXAm^^ATTAf^OAOr^Af^C'Tf^T 


y 1 o 


ri A TT A TP A A PTP A riT A 

OOA 1 1 A 1 OAoOMV^ 1 OMVj 1 


yi y 


^ r> A Tr*TP A A AT^i f^Tf^T 
oooA 1 U 1 LrAoAOoA 1 1 I 


Don 
y^u 


ooo 1 A I A 1 OAoOyiVjA I 1 A 


y^l 


oUAA 1 1 UoA 1 O 1 AA 1 1 


yzz 


AOUAA 1 oL»AAA 1 ALioOoUUO 


yzo 


APP A A AXTA AP APTXPPPPr* 




Pa a aV^ a ap^ Ar^ AxXxp/^r^r^P 


yzo 


XT A AXXPPPXP AXAXPPPPP 

1 lAAi I UUvj 1 oA I A 1 oUoUVj 


yzb 


PP AXPXA AXPPXXAXP APPP 
o(j>A 1 O 1 AA I olj 1 1 A I oAOL/o 


QOT 

yz / 


PP AXP A APXPf^XPXP A APXr^ 

oUA 1 voAA^o 1 I \j 1 UAAO 1 O 


yzo 


r^P 1 1 1 AAXPr^TPPXPAPPPP 


QOQ 

yzy 


P P XT A A A XTT A T/^ P A P: P 
1 1 AvpAA 1 1 1 Ao 1 oOAooO 


you 


rjPr^TPAr^AATTTATnPPAPA 
oUrO 1 *K^AoAA 1 1 1 A 1 oooAL^A 


yo 1 


PPTAP ATA ATTTAPPPP APP 
oL/ 1 AvjA I AA 1 1 1 AooL/L/Al^o 


yoz 


PPTPATAATPPT/^APPAPTA 
vjv-/ 1 oA 1 AA 1 vjO I OAvjoAV-/ 1 A 


you 


PPAPAATTPPATAPAPPPAP 
vjOAoAA I 1 oL/A 1 AoAOoOAO 


yo*f 


r^PATr^ATTAfiPATAPAPf^nA 
OOA 1 \Jr\ i 1 AooA 1 AVjAV^OoA 


yoo 


PPAPPA ATAPPA ATPAPPP/^ 
OOAvjOAA I AvjOAA I OAOovjVjj 


yoD 


ATTPPAPATTPAAPTPAPPP 


yo/ 


TfiPPA 1 1 1 APTTAfSTi^Pr^AP 
1 oooA 1 1 1 AO 1 1 AVj 1 oOoAO 


yoo 


P A APPP ATATPA ATPPTP AP 
oAAvjOL»A I A 1 OAA i Cjo 1 OAO 


QQQ 

yoy 


PPP APP A ATTTP ATPPP APT 
oL/oAoUAA 1 1 1 UA 1 oUL/AL/ 1 


y4U 


PPPPP A APTTTPTP AP ATP A 


y4i 


PPPP ATA ATPPTTPATAPTP 
iJoOLrA 1 AA 1 VJO 1 I vpA 1 AO I Lr 


y4z 


TTPPTPP ATPP ATPTPTPPP 
1 1 kjKd \ oUA 1 ooA 1 O 1 O I UOLr 


y4o 


TXXAP/^r^O AP^XT A/^PXXPP 


944 


XXAXPPP/~*OXA P APTP/^PXP 


y40 


XP AXP APPXPXXAPP APXAP 
1 1 VjAOO loll AoUAo 1 AO 


y4D 


PP APPAXPXPPXAPPP A A AX 
ooAOOA 1 o I oO 1 AOoOAAA 1 


y4/ 


PXPAPPAP ATXPAPPPAPAP 
o 1 OAIjOAvjA 1 \ OAoOOAoAo 


y4o 


P APAP APP ATPP APPPP ATA 
oAoAoAOOA 1 oOAoL/OoA 1 A 


y4y 


PPPTPPTP A ATPTTPPP APT 
oOo 1 Oo 1 OAA loll oOOAO 1 


Qc;n 

you 


PPPTTA ATPPPTPPPAPPTA 


yo 1 


PTPPTP aPattppppppatt 
o 1 oO 1 oAOA 1 1 OoOoOOA I 1 


yoz 


PPPTPTA ATP PTPPPPAP AT 
oOO 1 o 1 AA 1 Oo 1 oVjoOAOA 1 


ypo 


A ppPpPTP A A ATPP A P AT AP 
AoUoOo 1 oAAA 1 ooAOA 1 AO 


yoH" 


AppPiTPTr^AAATnPTATPAP 

AOOVJ I O 1 \Jr\r\r\ I Ow 1 r\ I wAO 


955 


AGTGCGCGAAATGTTCTACA 


956 


CGTCGCCAATATGATCGAAT 


957 


CGCCAGAAGTTCGAGCGATA 


958 


GGGCTACAGCGTGAGCTATA 


959 


TC5TCAGTGATCCGGGACTAT 


960 


GTTATCGCACCTGAGGCGTA 


961 


GTTGTGACCTCTGAGCACGT 
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oecj. IQ 


o lo 0 secjuence 




o M 1 OAUoL» 1 A 1 oUoAoUUA 


yoo 


ol 1 1 AuLrOO 1 u 1 L/UAovjvjjA 1 


yb4 


\ oUo 1 AUU 1 LrLr 1 VjUA I \J\D 1 1 


yob 


T/^ Ar*T Ar^r^r'^^Tr^Tr^r^r^ AT A/^r> 
1 (jALr 1 AuOvj 1 C3 1 UCjOA I AOo 


ybp 


1 UCjAL/ I AU(j 1 (j 1 Lr 1 UoA 1 Avj 


OCT 

ybf 


1 ACj I oA 1 AO 1 tjAu 1 LrA 1 VjoO 


968 


/^/^•T-/>T/-i A T/v A^/^/^r^ A /^T/~^T 

CGTCTGATACAGCCQAGTGT 


969 


GCCGTATCACGACGCTAGAT 


970 


A /^^XA^/^ A X A A A /^/^/^X A A 

AGCTCGATACAAOGCTAGAG 


971 


aTCTACTTAACGCGCTACAG 


972 


GACATCGTACCACTGCGTAG 


973 


/-N A /^T^/^X/^ A A /^X/^X/^X A ^ 

GAGTCGTGAGCACTGTGTAG 


974 


GACTCGGACCATATCTACGG 


975 


A /^X A A A A /^X A X/^X A /"^ 

CACTACGCAAGACTATGTAC 


976 


/-»/^ A/^Xr^X/^ A O A /^/^ A AX/^X A O 

0(j AG 1 G 1 GAGAGGAA i G 1 AG 


977 


r*^/^ AX/^XA/^/^ A/^/^/^ A AXAXA/^ 

OGA 1 G I AGGAGGGAA I A i AG 


97o 


A /^/N A j^/-k>^ A '/^/^ A ^ A ^X A A X 

GAGGAGGG AGGAGAG 1 AGA 1 


979 


r^/^'T A A.r^A/^r^/^Ar^r^r^A n a 
GG 1 AGAGAGGGAGGGAG 1 1 A 


9oU 


/^/^X AX/^^T"A/^/^ A /^/^/^ AXXAX 

GG 1 A 1 GG 1 AGGAOGG A i i A i 


yol 


r^c^Tf^r^c^ AT AC^r^ AC^ r^r^T AC^ AT 
GG 1 GGGA 1 AGGAGGG 1 AGA 1 


yy^ 


/^X/^ i^X A A ^ A 1^ A /^X A X 

G 1 GGG 1 AGAGGAGGGAG i A 1 


9oo 


r^Tf^r^Tr^r^T A A ir^r^r^ AT A r^T 
G I GG 1 GG 1 AGAGGGA i GAG i 


984 


r*T Af^ Ar^ AT Ar*r^Tc^r^ ac^ Ar^ ac^ 
G 1 AGAGA 1 AGG 1 GGAGAGAG 


98b 


r^T Af^r^r^r^ ac^ Af^r^r^ atc^ At^ at 
G 1 AGG GG AGAGGGA 1 GAGA 1 


985 


X A A r^T a r^T a r^T a 
1 AGAGGG I GGGAGGG f AG i A 


987 


c^r^r^r^T AC^ Ar*r^ AC^r^c^T at at 
GGGGG 1 AGAGGAGGG 1 A I A 1 


988 


/^X AX/^ A /^X A A A /^T A X 

GTATCACTAG G AGG AG GTAT 


989 


G I AG iGAGAG i GGGAGAGG i 


990 


CGACTACACAGCTCAGGATA 


991 


CACCGACAACTCGTAGaGAG 


992 


r^/^ A ^ A A ir^^ A A ^> A A X 

CG ACCCAC ACTAG gag AG at 


993 


A/^/^/^/^/^A/^A A/~*A/^/*^A*^A 1 I 

acgcgcacaacaggagactt 


994 


A /^X A A A aV^x/^ A A ^ /^x/~* 
AG I AGGAGAAG 1 GAGAGG f G 


995 


A i^X A A A A A O A ir* /^/^X 

AGTACAGCAACGCAGAGCCT 


996 


r^Tr^ A r^r^/^ AOi^/^x/^ a /^x a x t 
G IGAGGGAGGG 1 GAGG 1 AT F 


997 


t^T^ Af^r^r^ A r^T a a r^T a xo 
G 1 GAGGGAG 1 AGGAGG 1 A 1 G 


yyo 


T r^T r^f^T A r^T ^T A r^T A 
\ G 1 GGG 1 GAG 1 GG 1 GGAG 1 A 


yyy 


1 GGG 1 1 GAGG 1 GGGGA 1 G 1 A 


10UU 


I GG 1 1 1 AGG 1 G 1 GGGGG 1 GA 


1001 


Tf^Tf^Tr^Tr^ A i^TT r*' ac^t 
TGTGTG 1 CAGT \ GGGCGAG7 


n uUiC 


1 \^ 1 vjAoOAO 1 O I I 1 AVjVj 


1003 


GTTGATGACTCGCCACACGT 


1004 


CTGAGATCACAGCAGACTAG 


1005 


TTAGACTCCTCGCCGGTAGA 


1006 


TATAGCTCCTAGCAGGCGTA 


1007 


TATGCTCCACGTCTAGTGAG 


1008 


CTCTATCACCAGCGATGAGA 


1009 


CGCTCCAGACAGCATATAGA 
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Seq. Id 


3' to 5' sequence 


1010 


A AT A ^/^/^ AAA /^^T^T A /^/^/^ 

AC ATACCG AAAG CTCTAGCG 


1011 


ACATCGCTAAAGCACATCGG 


1012 


ATA T/^/^/^/^/^ A A T/^ A A /^/^ OTA 

ATATCGCGCAATCAACGCTA 


1013 


A T^^ /^/^ A <^T/^ A A /^^~*T A T~ 

CGATGCGCCACTCAAGGTAT 


1014 


"T A Ty^ /^/^^^ A A*/^ /^T/^ A OT A A 

TATGCCGACGGTCAGGCTAA 


1015 


TATCGCCACGTCCGGTGATT 


1016 


^ 1 r^y^^T*^ A OT A T^^ A T 

TCTCGCTCACTGCGTATGAT 


1017 


TATCCGTCACTCCGTAGAGG 


1018 


TATCGACTATCCCTGAGACG 


1019 


GTATAGACCTCTCAGACGCG 


1020 


CTATC GTAAT ATC AGTC C G C 


1021 


CGATGACAATTAGGTACACG 


1022 


GAGCATAATGACGTAGACCG 


1023 


GGACAATACTTGACAGCAGG 


1024 


CGATGATAATAGAGTAGCCG 


1025 


CTATGATTAAGTCGTAGCCG 


1026 


AGGTGAATAACGCATAGGCC 


1027 


A /^^r/~* A /^T A A T'/^ ^^T A A 

GAGTGAGTAATGCTACGTCA 


1028 


A T/^0» A A A T/^TT A A A 

GATCGACGAATGTTAGAGAC 


1029 


GACTCACGAATGCGGAGACT 


1030 


GACCGTCAATCGCGTCAGAT 


1031 


TACCCGCATGGACGGAGTTT 


1032 


^^T/~v A A /^T/^/^T/^ /^TTT A 

GTCAGCGCACTCCTGGTTTA 


1033 


TCAGGCCCACGTAGCGTTAT 


1034 


TTCGCGCTATCCATGCGTGA 


1035 


TGCTGATACTCGGCTGCATC 


1036 


A ,y^"^ A ^\ A ^^^^ ^^^ry^ A 1 1 ■ 

TGAGTAGCATCGGTGACTTC 


1037 


r 1 A ~f A y^ T^~\ ^^/^ y^~i~y^ y^ y^ y*\ a 

TTGTATCACTGTGGTGGCCA 


1038 


TTTAGTCAGTATGCTGGCGG 


1039 


TTAGGTTTATATGGCGGAGG 


1040 


TGAGATCACGTTCGCCGAGT 


1041 


y^ A ^r^y^ A ^^^^ a y^ ^^t^n ^\ a a 

GTATCATTAGGTCCGCAGAG 


1042 


TATCATGTAGACTCGGAGGG 


1043 


GTATGCTTAGATATGCAGCG 


1044 


TTGTAGTTAGCTCTGCACGG 


1045 


ATATCGTTAAGCCATACGCC 


1046 


ATTGTGATAAGGGTCTCGAG 


1047 


ATTCGTCCAACGCGGTCGAT 


1048 


ATATGCACAACGCGCAATCG 


1049 


TTAGCTCTATGGCAGTGCGA 


1050 


A TT A /^T/^ A A OOT/^/^ O A A 

ATTAGCTGAACGCCTCGCAA 


1051 


ATTATCTCAACGGAGGAGCA 


1052 


ATGTTGCTAACGGACGGACA 


1053 


ATGTGTTCAACGGAGACAGA 


1054 


CTCTTTCTAAGTGAGTCGAG 


1055 


CTGCTTGAAGTCGTCTCACG 


1056 


CTGCGTTGAAGTGGCTTACT 


1057 ■ 


GTGCGTTCACATGGCCGTAT 
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oeq. iu 


o 10 D sequsnce 




r* T A ^ r^r" nr* Ar*r*Tri A r'Tr* T AT 

1 AoOOoOMOO I oMU 1 o 1 A 1 


luoy 


\j \ AoUvpUOAoU I UAUVj MAI 


lUbU 


i^/^ r*r* o/^xr* A A "TO AT A ATT 
UVjjUUOo I OAUA 1 oA 1 AUA 1 1 


1 Uol 


r^r^T~rr^r^T Ar^r^ a to Ar^Tr^ Ar^T 


A ACJO 


o AACjoOuoo I AOALr 1 Lf 1 A 1 A 


lOoo 


CjAOAGCjVjOACAOoAO 1 G I a I 


1064 


TG CGCG G CACTCGTTCTAT A 


1065 


GCGGTTGCACTCGTAGCATA 


1066 


GAGGCGTGACCAGTCCATAT 


1067 


GGACGCTCACCAGTGCTTAT 


1068 


AGTGTCCAACCAGACCAGAG 


1069 


A T/^ A T A A A <^ A T A 

AGTG C C ATAC AAG CG C ATAG 


1070 


/^T A 1 1 A ^ A 1 1 /^/^/^ A A 

GTAGCCTTACATTGGCAGAG 


A f\'y A 

1071 


GTCGCCGCACATTCGGTTAT 


1072 


"1 r/^ A T/^ A A 1 1 A /~* A T/^ 

GTTGAGTCAGATTAGOAGTG 


1073 


1 OG 1 AGGGAO 1 GOGO 1 OA 1 A 


1074 


G 1 GAGA 1 oAGAGGGAGGGA 1 


1U/0 


G I G 1 GAooAGAoGGoAA 1 G 1 


lUr 6 


Ar^r^ ATf^ Ar^ A/^r^r* AC^ Af^ AC*' 
G 1 AGGA 1 G AG AG GG AG AG AG 


A n77 
\\jf f 


<^/^T/^ A ATTA/^ AT/^ A/^ A/^ A/^/^ 

GG • GAA 1 1 AGA 1 GAGAGAGG 


107o 


/^Ti^ A TT A T A ^TT/^ A T A iT* O 

G 1 GA 1 1 A 1 AGG 1 GA 1 AGGGG 


lUf y 


/^T A ATAT/^ ATO A A/^T/^/^r^O 

G 1 AA 1 A 1 GA I GAGAG 1 GGGG 


A f\Qf\ 

lOoO 


\ AG 1 1 A 1 GA 1 GAG ! GGGGAG 


lUol 


A Af^T ATr^r^Tr^ Ar^ Ar^T Af^r^r^ 
GAAG 1 A 1 GG 1 GAGAG 1 AGGG 


1082 


/~^r^ A 1 1 /^T/^ AOO A/^ AT A/^/^ A/^ 

GGATTG 1 GAGG AGA 1 AGGAG 


1083 


/^T A A T/^T/^ A /^/^ A f^f^ A A i^/^ A 

CTAA T CTGACCACGAGACGA 


1084 


C TG TATTG AC ATG AG AC GAG 


1085 


CTTCTCAGACATCGGACGAG 


1086 


A /^T/^T/^ A A TT A A A 

GCACTGTGAATTAGCGAGCA 


1087 


GCCTACGGAATTGGCAGACT 


A no O 

1088 


GACCTGGAATTAGGACACGG 


1089 


O ^T/^ /^/^ A" A 1 1 A /^/^/*^/^ A AT 

GCCTG CG AATTAGCG G AC AT 


1090 


/^/^ A T/^ /^T A A T/^ A T/^T/^T A 

GCGATGCTAATGATGTGTAC 


A \ A 

1091 


GCCCGTCTAATGAGTGGACA 


109il 


/^/^/^T A /^TO A TO AV^ A O A A 

GGG 1 AGG 1 GA 1 GAG AGG GAA 


A noo 


OO ATOO A O ATOOT A OO A O A A 

GGA 1 GGAGA 1 GG 1 AGGAGAA 


1094 


OOOOTOOO A AOOTOTO AT AX 

GGGG 1 oGGAAGG 1 G 1 GA 1 A 1 


A f\f\CZ 

1095 


/-N/^/^TOOOOO A TO A OTA O AX A 

GGG 1 GGGGGA 1 GAG i AGA 1 A 


1096 


oo A r^r^ f^r^f^ a a tt a oxoo at a 
GGACGGGGAA 1 1 AG 1 CGA 1 A 


A nn~7 
109/^ 


oo A/^ /^O AO A oo A TO To AT AO 

GGAGGGAGAGGA 1 G 1 GA 1 AG 




OOAOO AOO AO AOT/^ ATO^TT 
VjOAoOAoL/AOAL/ I oA I I 1 


1099 


GACCCAGCACATTAGCGAGA 


1100 


GCTCCTGCAATGTGGGGATA 


1101 


GCGCGTGAATTGTAGCACGT 


1102 


GCCACAGCArrGGAGAGAAT 


1103 


GGCAGGCTAATGGATAGTAA 


1104 


GCGCTGCGAATGAAAGACAT 


1105 


GCAGCGGGAATTAGATATAC 
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Seq. Id 


3' to 5' sequence 


1106 


GCAGGTGCAATGATTCTACC 


11.07 


GACCGGGCAATCACTTCAGA 


1108 


GCCGGGCAATGGGTTCATAT 


1109 


CGCAGGGCAAGCGATCAtAA ^ 


1110 


GCCACAGGCAGGGCATATTA 


1111 


GCCTAATCCTGGGAGAGTGA 


1112 


TCGTCTCGATCTAGGCCATG 


1113 


GTGTCTCGACTCAGCCTATA 


1114 


GACGTAGTAATCATGTCTCC 


1115 


GACTTATACGTCATGCGACC 


1116 


ACGATGTAACACAGCGAGCG 


1117 


AGTGGTGTAACCATGTGACA 


1118 


GTCGTGACAGTGATGTACTC 


1119 


GTGGAGTGACGtATCTCTAA 


1120 


TAGAGGTGAGGTAGTCGACT 


1121 


GTCGTGCGAGATAGCTCTTA 


1122 


GTGTAGAGATATAGCATCGG 


1123 


TAGTCGTdAGATAGCG ATTC 


1124 


CAGTGTGTACGAATACGAAG 


1125 


CGAGTGTCACATACCAGATA 


1126 


CGTATAGCAGACAGCGCAAT 


1127 


GACATCGACGACAGGCCATA 


1128 


CGAAGCTCACGTAAGTCAAG 


1129 


TAGTGCTCACGTAGGCGAGT 


1130 


TGCCCACGGTGAGCTAGTTT 


1131 


TAGCTGCGAGGAGGGTTCTA 


1132 


TGGGCCTAGGGTGTGCATTA 


1133 


TAGGGTACTGATGAGCACTG 


1134 


CTACGGGAAGGTTAGCACCA 


1135 


TGGTGATAGCTGTGGGGCTA 


1136 


GATTAGATACCACTGCCACA 


1137 


GGAGTGATACCTGGATGCAG 


1138 


AGGTGAGGAAATGTTCACAG 


1139 


GAGGAGATAATGGTCAGTAC 


1140 


CAGGGAATAATACATCCTCG 


1141 


ACAGCAACAAGTCGAGGCGT 


1142 


AGGGAGAGAAATCAGCCCTC 


1143 


C AAG AG ATAATACGGCTGCC 


1144 


CAAGTCCTAAGAGAGCTACG 


1145 


ATAAGCGCAAGACAGGCGTC 


1146 


ATCTGAGCACAACTAGGACG 


1147 


CACAGGCTAAGACAGGAGCT 


1148 


CATAGCGTAAGCCAAGCAGC 


1149 


CATAGTCTAAGCCACATGAG 


1150 


GACAGTACATGCCAATCAGC 


1151 


GCGGTAATCGGTGCATCAAA 


1152 


GGGAGTATAGCTGACCATCA 


1153 


GTAGiGCAGACCTGATCCCTT 
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Seq. Id 


3' to 5' sequence 


1154 


GAGCCAGACCACGCTTGATT 


1155 


GGCGCATCACTAGCCAGATT 


1156 


GGAGGTACATCCGCGAGTTT 


1157 


GGAGTCTACCCAGGGCATTT 


1158 


CGCGCTCTACACGATGGATA 


1159 


CGTGCCACACCTTGGAGTAT 


1160 


CGCGGCACACAGTTCAGTAT 


1161 


GCTCGTCCACAGTGCGTTAT 


1162 


GCTGAGGCAGAGTCCAGTTA 


1163 


CGGTAGGGACAATCAGCTTA 


1164 


ACGCACCGAAAGTGAGGGAT 


1165 


AGGTCGTCAAAGTGCAGACA 


1166 


ACGCAGTCAAAGTCATATCC 


1167 


GAGAGTCTAAGATCAGCAGG 


1168 


CACTGTCTAAGATAGACACG 


1169 


CAGCGTACAAGCTATACAGC 


1170 


CCGACGACAATGTACGACAG 


1171 


GACTAGCGAATCTAATGAGC 


1172 


CGTCGAGCAATATGAATGAC 


1173 


GTGTCGCGCAGTTCATAGGA 


1174 


GGGCG ACCAGG ATAG AG AAT 


1175 


ggcacacacgtctcggataa 


1176 


GGCAGACGAGGTTGCATACA 


1177 


CGTGGGACACAGTGGATCAT 


1178 


AGTGCGAGAACATCGTGTAA 


1179 


GGCAGCACAGCTTGTAGGAT 


1180 


GACCATTGAATATGTCGAGC 


1181* 


GtACGCATATTTAGCCAGCA 


1182 


GGCAATCTGTTCAGGACCAA 


1183 


GCTG AGTAATTGGTAG AG AG 


1184 


GGTGTGTAATTGTATGCACG 


1185 


GTTGAGACATTGTTAGGAGG 


1186 


TTAAGAGATTAGTCTGCCGC 


1187 


TCACGTAATTTGTTAGCCGC 


1188 


TGAGTGATAGCTCGGATCTC 


1189 


ATGATGATAACTACGTGCCC 


1190 


ATGCGAATAACTATGACGCC 


1191 


ATGGAGATAACTATGCAGCC 


1192 


TCGTTGCGACCTATGCGTAG 


1193 


TAGTTCGCACCTACTGCTAG 


1194 


ATACGTGCAACCACTGCTAA 


1195 


ATGTCGATAACGTCTGCTAC 


1196 


ATCTAGTCAACCTGAGCTAG 


1197 


AGTATAGCAAGCTCAAGTCG 


1198 


AAGACAGTAAACTCTGGTGG 


1199 


ACGATAATAACAGCTCCTCG 


1200 


ATAGATATAACTGAGGGGCC 


1201 


ACTGTAATAACCAAGCCTCG 
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oeq. 10 


o lo o sequence 




AO 1 oA 1 AoAALrUAOAoL/oUo 


IzUo 


AX/^r^/^r^ A/^ Ar^ Ar* AT A/^ A 

A 1 LjCjOUAUAUAUA I AOAoOo 


A on/1 


A^/^/^r*/^A^A A AXA*^/^ ATf^r^r* 
ALroUUVjAoAAA 1 AOvjA I L>ULr 


OAK 


UAOUUCjAUA I L/AA 1 o 1 AU 1 A 


■< one 


/^/^A/~»A/^T~A A Xy^ A A X/^ A TT^r^/^ 

OCjACjAo I AA 1 OAA 1 OA 1 OOCj 


1207 


CGAGCAATACAT AOA I OTGC 


1 208 


C AAC ATAGTTAC AO AOG CTG 


1209 


GAG CTT ATAG AG AC AO ACTC 


1210 


/^/~* ATA A A /^T A A A /~^/^T/^/^ 

GoATAGAAGTAGACAooTCG 


121 1 


CToAGAGAoATGAoAoToGA 


1212 


A T/^ A /^T/~* A A /^T A A T/^ A /^/^ 

ATCAGGToAAoTAAToAoCG 


1213 


A /^^/^/^ A /^T A A A T A /^/^T — T A ^ 

AG oG o AG TAAAT AG oTT AG C 


1214 


A /^T/^/^ A /^/"k A A A A T/^ A TT/^/^ 

ACTCC ACG AAAo ATG ATTG C 


1215 


/^T/^ A AT A T A A A /^/^ AT^^^^ 

CTC AATATAG AG ACG ATG Go 


1216 


/^/~^/^ A 1 1 A A A A A T/^ A 

CGCATTAGAGAGAGA 1 CGAG 


1217 


O A A T/^ A A T A A A 

OGOAOA 1 GAOA 1 AGAGOAOG 


H O-l Q " 

121o 


/^/^/^ A AXX A/^ A A/^ A/^ A /^/^/^ 

OGOAOA I 1 AG AO AG AG AG GO 


H 04 Q 

I2iy 


O I AG AO 1 AA 1 GO AG AG AGOG 


40on 


/^/^/^x AxAi^ AX/^r^ A/^ A/^ A~rr^r* 
GOG 1 A 1 AGA I GOAGAGA 1 OO 


4 OO-I 


X/^ A/^XA/^/^/^X/^/^ A AXA/^ A/^/^ 

1 OAO f AGOG I GGAA I AGAGO 


A ooo 
12z^ 


OAGAO 1 GAAO i OAA 1 G 1 AOO 


/| ooo 
122o 


^ A A X/^ A A ^X A O A X/^X A O 

OAOGA 1 GAAO 1 AGA 1 G 1 AOO 




r^r^ A A~rr* ax a a r^~r at/^ A/^r^r^r^ 
OGAA 1 GA 1 AAG 1 A I GAOGGO 


HOOK 


A/^ AX/^/^ A Ar^XAXA/^XA/^/^ 

OGAGA 1 GOAAG 1 A 1 AG 1 AOO 




<^ A X A O A A X A X A ir^ A 

GGA 1 AGOGAGA 1 A 1 AGAOOO 


1227 


ATA A A T/^ A /—^ A T/^ 

G GAT AG G AO G ATG G AC G ATO 


1228 


GTG ACAGG AC ATGOAATCG G 


A OOO 

1229 


T AX A AT^/^XT/^/"* AT/^ A /^/^/^ 

TA \ ACATGoTTCGA 1 OAOOG 


H oon 

1230 


A T A T/^ A A T A AV^T/^ A O 

AT ATG AAT AACTG GG AG G O O 


4 OO H 

1231 


AATA/^/^A A Ar^ A T/^ 

AATAGG AAAG ATG C G G G C O G 


>< OOO 

1232 


A/^A/^ATA/^A A A Ti^Ti^/^/^/^/^/^ 

AGAGATACAAATGTCGCGCG 


A OOO 

1233 


A />y^ A A T A A A A X/^T^^^/^/^/^ 

ACG AATAG AAATGTGGCCG C 


4 OO >1 

1234 


AO ATT A /^X A A A/^#^X/^i^/^ A/^/^ 

AOATTAOTAAAGG 1 GOGAOO 


A ooc 
i2oO 


A A T — r A /^T A A A T/^/^Xy^/~*0/^/^ 

AGA 1 1 AG 1 AAA 1 GO I GOGOO 


123d 


A/^TATOATA A/^A/^OA/^/^/^/^O 

AO I A 1 GA 1 AAOAGOAGOOOG 


123/ 


A X A X/^ A A X A A /^X/^/^ A 

A 1 A 1 GAA 1 AAO 1 OOAGOGOO 


A OQQ 

12po 


A A r^~rr^ A A AT/^x A Ar^ r^r^r^r^ 
AGAO 1 GAAA 1 O 1 AOAGOOOG 


4 OOO 


/^X A /^X/^ A T A A TX/^ r> A X/^/^ /^i^ 

G 1 AO 1 GA 1 AAT 1 GGA 1 OGOO 


4 o>m 

1240 


/^/^ A A A r^~r~r a a av^t 
OOAGAAOGG I 1 GOAGAOAO 1 


•1 O/l H 

1241 


GOAA 1 AG 1 1 GGAOOOAGGO 1 


lZ4^ 


ooAA 1 AoVjj ( oOAO 1 OAO i OA 


1243 


GCACAAGTTTCGCGCATCGA 


1244 


GCGGAATCTGTGCAGCATCT 


1245 


GCGAGAATATGGTGACATCT 


1246 


GCGGTCAATTAGTGGACTCC 


1247 


CTCCTACAATGGTGACACTG 


1248 


CTATTACAATGGTATGCCCG 


1249 


AATCATACAAAGTGTGCCGG 
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Seq. Id 


3' to 5' sequence 


1250 


CATGATCTAAGAGTGTAGCC 


1251 


CAAGAAGTAAGATGCGTGCC 


1252 


CATGTGATAAGATGTGGACC 


1253 


A A 1 ^' A ^\.^\ AAA y\ 1 1 . A ^% 

AAGTTAGCAAACTTAGCGCC 


1254 


TCTTCGATATGATAGGGTCG 


1255 


GACGTTAATTGATGAGAGGC 


1256 


GCGTGAAGTTGTTAGCAGAT 


1257 


GCCGATACATGCtGCACGAt 


1258 


CGCCGATTAAGGTGGGACAT 


1259 


CGTCATtTAAGTTAGGGCAC 


1260 


CTCGATCTAAGGTGCGATAC 


1261 


CG CTTATG AAG GTG G AG ACC 


1262 


GATGAGTGAATGTGAGTGAG 


1263 


CGCTAGTGAGAATTATGTGC 


1264 


G CTAGGTG AG AGTATG CTAT 


1265 


GGTGTGGTACGACGTTGAGA 


1266 


GGTAGAGTAGACCGATGCGA 


1267 


GTATATCGAGATCATAGGGG 


1268 


GTCTTGGACTATACGAGCGG 


1269 


TACTTGTAGATAGCGAGCGA 


1270 


GTACTCTGACATGATTCGCA 


1271 


TATAGTGACCTTATCGGCAC 


1272 


TCGTCTTGAGATATGTGGAC 


1273 


TGATGTTAGGGTATGCGAGA 


1274 


TCATCTGCACGTATCGTCAA 


1275 


GGGAGTGGAGAGATTGCATA 


1276 


CGGGCGCGAAGTATTGACAT 


1277 


GTGTGGGCAGGTATTCCATA 


1278 


TGCGGGCACGGTGTGATATA 


1279 


TGGGGGCTAGTGGCTCTTAA 


1280 


TGCGCCGCCAGTCTGTtATA 


1281 


TGGGCGTTAGAGTGTGCACT 


1282 


ATGGGCGCAACCCTGTCATA 


1283 


CAGCCCTGAAGACTGCGATA 


1284 


y^ y^ y~\ A A y^ y^ y^ A 'r y^ A ~f" a 

CGCCGCTCAAGGCTATGATA 


1285 


y^ ^r* y^ y^^^y^ A a y^ •r" a y^ ^^^^ a a 

CGCTCCTGAAGGGTAGTTAA 


1286 


y^ y^ y^ y^ a A .y*v y^ "TP a ■•^^^ A 

GGGGCGAGAGGTGGTATTAT 


1287 


GGATAGGCAGATGGACTTAT 


1288 


A yx A y^ A y^ y^ ^^^^ ^x a y^ y^ a y^ y^ •i^ A 

GGACAGACGTTGAGGAGCTA 


1289 


GTAGCGACATTGAGTTAGCA 






1291 


CTACACTAATTGCAGCAGCA 


1292 


CGTACCCGAATGCAGCAGAA 


1293 


GACGCCTAATGACGCTGAAA 


1294 


TAGCTTGTACTGCGACTGAC 


1295 


GATACTCTAATGCCATCGAC 


1296 


CGGCGTACAATGCCATAGAA 


1297 


CGGATACGAAGGCTATGCAA 
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1298 


ACGGATCGAAAGGTATAGCC 


1299 


ACGGCGCGAAAGCGTCATAA 


1300 


CGTGAGGGAATACGTCATCA 


1 301 


CACAGTGGAAGACGCATCAC 


1302 


GAGGTGACATGACGTACATC 


1303 


GAGTAGCGAATGCTCAGCCA 


1304 




1305 


CGTATGTCAAGGGCCTnATA 


1 306 


CGAGACGCAAGGGATTTACA 


1307 


GAGACGCAATGTGAATTAnG 


1308 


GATCGCACAGGAGCGTATCA 




TGCCCAGAGCGTATGAGnAA 


1310 


TGAGGGCGAGCTATCTATCA 


131 1 


TTGTGGCTAGGTATCGCTAC 


1312 


TGGTTAGCAGGTATGATCCT 


1313 


CTCACTGCAAGGATGGGACT 


1314 


TCCTGt AG ATCCCTATGCGG 


1315 


TCGTTGTCAGCATATTGAGC 


1316 


ATCATGTGAACCTATTGGCC ^ 


1317 


TACACTGGGACCTATGGGCA 


1318 


TACCTGGGAGCAtAGCTGAC 


1319 


TAGCCCGCAGCATAGGGTAT 


1320 


GAGCCTCAATGCTACGGAAG 


1321 


GATGTTCAATGCTGGCCGAA 


1322 


GAGTTGTGAATATCTGTGCC 


1323 


GCCGCCGAATTATTGAGCAA 


1324 


TG G ACTG ATTG AT AG G C A AC 


1325 


TGGCAGATCGGTGTATTGAA 


1326 


TATGCGTAATGGGTGTTrPA 


1327 


TTAGGTGGATTGATAGTGGn 




TCTGCTTTAGTGGGTAGrrA 


1329 


1 I GACGAG 1 1 1 GCAG I GCTC 


1330 


CTTGATTAAGTGCTGTACGC 


1331 


CTCGGATCAAGGCTTAGCGT 


1332 


CCGGGCTCAACGCTTTGTAA 


1333 


TGTGGCnC AGCTC ATGTGTT 


1334 


CTGGACCCACAGCTATGGAT 


1335 


CACGGGCCAAGAGATATACC 


1336 


CGCCCGCCAAGTGATGTATA 


1337 


CGCCAGCCACATGGATAGAT 


1338 


GGCCGGATACATGCGATTAG 


1339 


gctggcgtAcatccgtatga 


1340 


AGATGGCGAAATCCGTATAG 


1341 


GCAGGGACATTACGATCAGT 


1342 


AGCAGGTGAAATCGTACTAC 


1343 


GCAGGTCAATCTCTGTACGA 


1344 


GCATTGTAAGTTCGGTCAAG 


1345 


GCACTGGTAATTCAGCTACG 
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1346 


AG C ATC ATAACCC AAGOTGG 


1347 


ACCAGTCCAAAGCATAGTCG 


1348 


ATCA 1 1 1 CAACGCAGTGACC 


1349 


TCAGCCCTATCGCAGGATGT 


1350 


GTCAGCACCAGCCGTGATTA 


1351 


y-^ A A I 1 A y~v A y^ A y^ y^ ■ i. y^ a 

GAATTACGCACCCAGCTTGA 


1352 


y^ A A "V*^^ y»x y^ y^^^ A y^ y^ a ^\ y^ a A 

GAATGCGCCTACCAGCTATA 


1353 


GAATGGCGACAGCGTACATA 


1354 


GGATTGCCACGACTCACAAA 


1 355 


GCTCATTGACACTGCGCTAT 


1356 


GAGCATGGACCACGGCTATA 


1357 


CAAATGGAGAGACAGGGTGC 


1358 


CACTTTGAAGCACAATCACG 


1359 


GGTGTTGCAGGACGCATGTA 


1360 


TACCTGGCATGACGGGATAT 


1361 


TTCGTGGACTTGCGGATCTA 


1362 


1 1 y^ j^^ny^v y^ y^ A ~t" A y^ >^ ■ ■ ■ ■ 

TTCCTGCGATAGCGGCG 1 1 1 


1363 


i y^v A "l—y^^'y^ A ~r" A y^ y-% y^ y^ y^ ^^^T y^ 

TTGATCTGATAGCGGGTCTC 


1364 


TTGATGGCATAGCGTCTGAC 


1365 


TTCGAGGCATGTGGATCTCC 


1366 


TTCAGCGGCTAGGGGATTTC 


1367 


Ty^y^ A y^ a y~\"A "T"y^ y^ y'^ x^>~v A y^ 'i tt '1 

TCCAGCAGATCGGCGAGTTT 


1368 


TTCAGGCGATCTGCCGATAT 


1369 


TTCTATCGCATGTCAGCCGT 


1370 


^rJ^^r A A "^yv y^ y^T"y^ y^ y\ a y~»y^ y^ T" A 

TGTAATGCCTGCCAGCCGTA 


1371 


TAATTGCCTGCACAACTGGA 


1372 


TAATTCCATTGACGGCAGCG 


1373 


TTATTGCCATAGGGCGACGG 


1374 


AGAATTTCAAAGGCTGACCG 


1375 


A A y^ AAA a y^ "T" A y-^ — f— 

AGAGGCGGAAAGCAGTAGGT 


1376 


CGAATGCCAAGGCCAGCTAA 


1 377 


y^ A y^ y^ ^ y-v A A y^ y^ y— w-t~ y^ A /~V A 

GATGGTTCAATGGGTGGACA 


1378 


GTGGGCGAAGTTCTGAGAGA 


1379 


y^ -T* y^ y^ y-v a a t~ a y^ A ■¥ t" A a ~r" 

CGTGGGCAATACAGTTGAAT 


1380 


GAGCTGCGAATCGGTATTAA 


1381 


A y^ y^ y-\ y^ y— \ A A -*fy^ y~\ A y~\ y^ A ~T" A A 

GAGGGGGGAATGGAGCATAA 


1382 


GAGTTCGCAATCGGGACGTA 


1383 


A y> y^ y^ y^ A a "T"/^ ITy^ ^N^F A ~T" A 

GACGCGCCAATGGTGCTATA 


1384 


yi A "^r/^ AN y^T'y^ A A /^-r'y^ y^ ^^^T A A 

GATGGGTGAATCGTGCGTAA 


1385 


GATCACTGAATGCGACGTAA 


1386 


GATCGTGCAATGAGGTTACA 


1387 


GAGGAGTAATTGAGATGCAC 


1388 


GACCGATAATTCGATATGCC 


1389 


TAGCATTGATCCCATGTCAC 


1390 


TTCAGCTTATGCCAGTCGCG 


1391 


TGACGGCCTTGCATATCCGA 


1392 


GAAGGCGGCTTACATCAAGA 


1393 


GAATACCAGTTACACTCCAG 
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1394 


CAAGAACTGTTACACATCGC 


1395 


GACGAGAATGGACTACACGT 


1396 


TACAGACGCTTGCATAGATC 


1397 


TAACGACCTTAGCGACGGGT 


1398 


TAACGACGCTTtCGCAAGGA . 


1399 


TTAGCGCTGTTGAGCGCGTA 


1400 


TTCCATGTATCGAGCGTCAG 


1401 


TATACGCCCTTCAGATCGGG 


1402 


CTAAGCCTATGCAATATGGC 


1403 


CCAGCTATAAGCATATTGCG 


1404 


TACAGCATTGTCATGGACTC 


1405 


TAAGCTATTGGACATTGGGC 


1406 


TTAGCATCCTGTGATAGGGC 


1407 


tCTAGCAGCTTTCATAGCCA 


1408 


TCATCACGCTTTCCGAGGAT 


1409 


GCATACATTGGACGAGAGCT 


1410 


TCTAGCATTTAGCATGGTGC 


1411 


TTATGACTTGATCTGAGGCG 


1412 


TGTTCGCAGTGGCTTAGCTG 


1413 


GAGTTGAATGCAGATAGCTC 


1414 


TGCAGGCTCGCAGATGCTAT 


1415 


TGCGAGGACTGTAGGTTAAT 


1416 


TGGGCACTCTCGCCTAGTTT 


1417 


TGAAGCGCCTCGACTAGGTT 


1418 


TCATGGGCACTGATAGGTCA 


1419 


TCATGAGGCATGGAGCCAGT 


1420 


TAATCAGCGTTACGTCCGCA 


1421 


GAATGTGACGCAAGTCTGAC 


1422 


AGATTTGCACAGATAACGCG 


1423 


GATTACTGACCAGCATGGAG 


1424 


AACTATCGAAACCGCCAGGG 


1425 


ATAATACAAGAGTCGCGCCG 


1426 


ATAATCATAACCTCGACGCG 


1427 


ATTATCATACAAGGCAGGCG 


1428 


TATATCGGATCAGCAGGTGA 


1429 


TAATTTCGGTAGGCAGGGAG 


1430 


TAATCCTGTTACGCGGAGGC 


1431 


CTTTAGCTCCACGCAGTGTG 


1432 


TTCTAGCCGTGCGCAGTTTG 


1433 


GTCATGCGAGCAGCAGTCTT 


1434 


GGCGTTCGAGCAGTCATCTT 


1435 


TACCGCCAGTCAGCGAGTTA 


1 436 


TACGGCCTAGGAGCATTGGT 


1437 


TACGGCAGTGGATGTCAGGT 


1438 


TGTCTCGATGCAGGTGTAGT 


1439 


GCCGGATGACGAGGATATAC 


1440 


TACCGCGAGGCAGGATTCTT 


1441 


TACAGCAGTGCAGGGCGTTA 
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beq. Id 


o to 0 sec|uenc6 


4 A AO 


oO AoO 1 AoAVjOAoAo I A 1 OA 


144o 


VjjAOAvjUAoA I OAoAoAU 1 


4 A A A 

1444 


ta ao/^a/^/^t r~r a a ^t/^ aV^ 
1 AACjL/AOo I 1 1 AUAUL/ 1 (jAU 


1445 


X A i^r^r*r^^r^~rf>r^ a a t at 
1 AACOCjI \d I CSOACjA I 0(j(jA I 


A A AC 

1446 


TACT(jUoCjACOT(jjoA I G I AU 


1447 


TCAGGGCTACTCGATTG(3AA 


1448 


TCCGCAGACTTAGCGTTACG 


A A Ar\ 

1449 


TGAGCAGCCTACGTTACTAG 


A A C f\ 

1450 


TGCGTCAGATGCGTATATGC 


1451 


TCGTCCAGATGCGGAGTTCA 


A A iCO 

1452 


TGGGCTATATGCCAGATCGT 


A A izn 

1453 


AAGGACAAAGAGGGCGTCTC 


A A C A 

1454 


TAGCACCGATGGCGAGCTTA 


A A CC 

1455 


TA^Ty^/^ A rf^T^ ^ ^ A A T A T 

TGTCCACGGTGGCGCAATAT 


A A CC 

1456 


TGGTCCGAOTGCTGCTACTA 


A A IZ~t 

1457 


TGTGCCGACTGCCGTCTTAT 


A ACQ 

14oo 


I'Ti^/^/^ A /^T ATO/^ AT/^/^/^T at 

TTCGCAGTATGGATGGGTAT 


A A Cf\ 

l4oy 


XT A l^/^O a /^TT/**'/^ A T^ ^ A ^ ^T 

! 1 ACGOAG 1 r GGA 1 Go AGO 1 


A ACf\ 

14bU 


1 1 O 1 GA 1 1 AGO 1 GOGGAOGO 


A ACA 

14o1 


T/^/^TTAT A/^TTT/-*^/^ A/^ A/^/^ 

1 GG I 1 A 1 AO I 1 1 GOGAGAGO 


A A CO 

1462 


TTT/^ I'T A/^/^TT/^O/^^/^ A/^/^rf^ 

TTTGTTAGoTTCGGGoAGoo 


A A CO 


TT/^ T/^TO A T/^ O A T A 

1 1 GG 1 O 1 GA 1 OOGGGOA 1 AO 


A AC A 

14b4 


1 GO 1 1 GGAO 1 OOGGOGA 1 1 A 


A A 

1465 


oTGoTTGGAoCAGoOAGTTA 


A A 

1466 


A a /^T^ r^r^ K A a a a y^T 

AAGCTGGGAAACGCAoACCT 


A A 

1467 


A A /^/^/^/^ A A A /^/^ A T A T/^/^T 

AAGCGGGoAAAoGATATGoT 


1468 


AAATGCCGAAAoCATCTCGT 


A ACf\ 

1469 


CCATTCGGAAGCGAoTCGAT 


A A ~7f\ 

1470 


TAoATGGGoTGAGAACGCAA 


A A~TA 


TATTGGGCAGGAGoGCCTAT 


1472 


oATGGGGGAAGAGTAGoACA 


A A 

1473 


ATTTC ATGG AG ATAGC AGG C 


A A^ A 

1474 


A T~r^/^ A A A A /^/^/^ A A /^T 

ATTGCAGCACAAGGCAGAGT 


A A "7 IT 

1475 


TTGCTAGGCTCAGTOOCGAT 


A A "7C 

1476 


T 1 GGOGAGO 1 GOG M O 1 OA 1 


A A "7*7 

1477 


TOOOAGAGA 1 GOGAO IGO I A 


A A ~7Q 

1478 


1 1 r^r^'r^TT^ at/^/^/^o AT/^ti^t 

TTCGCTGGATCGGCATGTCT 


A A ~Tf\ 

1479 


TT GCTCCTAGCTCGCGTGAT 


A A ork 

1480 


TTGCTGCTAGTCCAG 1 AGGC 


A AO A 

1481 


^ A TT A A A T/^O A a A /^/^ 

OA 1 I AAGOAG i OGAGAGAOO 


14o^ 


OVj 1 1 AA 1 oOAv^OoAoAA 1 OA 


1483 


CGCAAGCTCAGCAGAATTAC 


1484 


CCATGTCGAAGCATTCATAC 


1485 


CTGAATGTAATCATCGTGCG 


1486 


CTTAGATGAATCACTGCCAC 


1487 


CTTCACGGAATCTAGGCACA 


1488 


CACTCTTGAAGCTAAGCACA 


1489 


CCTCTAAGCATGTTGACACA 
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1490 


CATGCCGGAAGATGCGTACA 


1491 


CAGGCAGCAAGATGTACGAC 


1492 


CAGTGGGCAAGATAAGATTC 


1493 


CCGTGCCCAAGCTAGTGATA 


1494 


GATCGGGCAATCTGCGTACT 


1495 


TTCAGTGCATTATAGTGCGG 


1496 


TTATCTGCATGAGTAGGTCG 


1497 


TCGATAATCTTTGTAGCGCG 


1498 


TCTTACAGCTTTGCAGGGAG 


1499 


TCCTACATTTGCCACGGGAG 


1500 


TCTTCATCAGTGAGGCGCGA 


1501 


TTTCTAGGATGTATGCGAGC 


1502 


TATCCAGCATTACTGCGAGA 


1503 


TTATTCTCAGCACGCACGGA 


1504 


TGATTCGCACTCGCGGCTAA 


1505 


1 1 lUIAIUAUIUCiUIUUCiAA 


1506 


TTCGGATCAGTGGATGGAAA 


1507 


GATCGTCAATCTGATGCAGC 


1 508 


AGATCGCTAAATGAGGACCC 


1509 


GATGCTATAATGGTATGGCC 


1510 


A A .^^^^ AAA ^^^^ A A 

AGGAGCGTAAATTATCAGCC 


1511 


GGGCGATGACTATATCTGAA 


1512 


CTGGATTGACACTAGCATAC 


1513 


CTGCGGATACCATAGAGAAG 


1514 


ACTGCAATAACATATCCGCG 


1515 


AATGACATAAAGTGCTGCCC 


1516 


ACATGGAGAAAGTAGTCCGC 


1517 


ACAGGCGAACAATGTACCCG 


1518 


ACCAGCACAAAGTCTACTGT 


1519 


AGAGAGCCAAATGACTGTCC 


1520 


TAGTGCATAATTGCTTGCCC 


1521 


TGAGCATATAGTATTCGGGC 


1522 


TGAGCGTTAGAGCTTGATCC 


1523 


TAGGCGCTAGGACTCGTTAT 


1524 


TATGGCCGACGATGtGtCAC 


1525 


TATGGCTGACGTAGCGCACt 


1526 


TCTCGGTTACTGAGTGGACT 


1527 


ATAACGGGACAGAAGCTGCt 


1528 


ATAGAACTCAATAGCCGCTC 


1529 


CATAATACACATAGGCTGCG 


1530 


CAGTACGCAAGCAGATAGCC 


1531 


CAGACGCGAAGATAAGTTCC 


1532 


CAGCCAAGATAGCATACTCG 


1533 


TCCCATAGATAGCTCGCTGG 


1534 


TTCGCATGAGTGCTGAGTAG 


1535 


TTCCATATACTGGTCGGCAG 


1536 


■nTATGATATGCGTCGCGGA 


1537 


TTTCrrATATGCGCGAGCGG 
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1538 


TGTTGCATATTAGCGGCTCG 


1539 


ilATATGACATCTCTTGCCCG 


1540 


TTGTCAGATTTGCGCTCCGA 


1541 


GCATGCGAATTGCGAeGACT 


1542 


GGATCTGAATTGCGCGACCA 


1543 


GGCTATGAATTTGGCATCAC 


1544 


GGATATGCAATTTGTAGCCC 


1545 


CAGCGTATAGCAAGATGGAT ' 


1546 


CGAGGGATAATCAAGTCGAG 


1547 


CGCGGATGACACATACTCAG 


1548 


CGACGAGCACCAAtTCGAGA 


1549 


CCGTAGTGACCAATGCAG AC 


1550 


GCGATATACATCATTCGGAG 


1551 


GACAGTCTAATGACTCGTAC 


1552 


GGAGTTATACTAAGGTGTGC 


1 553 


GGAGTAGTAATGAGTGTCAC 


1554 


GCAATGTAGTCGAAGTGTCT 


1555 


GCATATAGATACCATTCGCG 


1556 


CGAATACTAGACACATTGCG 


1557 


GAAGTAGAGTACACAGCGTG . 


1558 


AGACACAGAACTACCGCGTG 


1559 


AT AGCAGAACGTAGACGCCG 


1560 


ATACAGTGAACTACATCGCG 


1561 


AGTACAAGCTAGAATCCGGC 


1562 


GAAGACTACTAGATACGCGC 


1563 


CGATAATACTAGAGACTCCG 


1564 


GGGTGGGTACACATAGATCA 


1565 


GGTGAGGGACACATGATCCT 


1566 


GTGTAGTGACATATAGAGCG 


1567 


ATGTCGTCACAGAGAATACG 


1568 


ATGCTAGGAACTACCAATCG 


1569 


ATGATAACGTACACAGGTGG 


1570 


TCGGTGTACGTCTGCTCAGT 


1571 


GGCTCACGATCCACTGGTTA 


1572 


TGCCTGATACCTTGGATGAC 


1573 


GGCCGTGAATTATCATAGAG 


1574 


GGCTTGGACGCATTGATAAC 


1575 


CCCAtCGAAGCATGTGTAAA 


1576 


CGGCATCGAAGGCGTTCATA 


1577 


GCCAGTTGACCACTTCTGAG 


1578 


TCGCATTAGCCATGTGGAGC 


1579 


GCAATCTAGTCTAATGGCGC 


1580 


CTAAGATGTTCTAATCGCCC 


1581 


CCAATAGTAAGTAATGGGCC 


1582 


TCATTATACTCTGAIGGCCC 


1583 


ATGCTAATAAGTGATCGCCC 


1584 


AGTGTCAACCATGATGAACC 


15.85 


AGAGCATAACATCATGGCGC 
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1586 


AGAATCTAACAGCGATGCCG 


1587 


ATTTAGACAAGTCGATGGCC 


1588 


ATATTAAGAAGTAGGGGGGC 


1589 


CATATCAGAATACGATGGCC 


1 590 


GATATACAGGATTATGGCGC 


1591 


CATAAATTGGTTGACACCGC 


1592 


GAAACTCCAATTGAGCGGAC 


1593 


GAACAATGAATTTAGCGGCC 


1594 


TTCCATTAGATGTGATGCCC 


1595 


TATGATATCATCTGAGGCCC 


1596 


ATCAGAAGAACTGCACGTCC 


1597 


AGCACAAGAACTACGCGCTG 


1598 


AGCAAAGAACCATGCCGCGT 


1599 


TAAAGAGCAATGTGGCGTAC 


1600 


TTCAGGGCATTGAGCGTAAA 


1601 


TTAATGGGCTTGAGCGTATC 


1602 


TTAATGCGGTTGAGATGGAC 


1603 


^\ A ^\ A A AN A% A A^ A A A% A ^P,/^ 

GGAGGGATAGCAGATAGATC 


1604 


TCAGGAGAGGCATCGCATCA 


1605 


TTATCTTAGGGATGCGGATC 


1606 


TGTGCTCTAGGTCATCCGAG 


1607 


1 1 ' ANT~ A "T"ANT A ANT" AN AN AN A AN ANA** A A 

TTGTATCT AGTG CG AG G C AA 


1608 


TATTATGTAGTATGCGCGGC 


1609 


T" A /"N" 1 1 A -^^N A A ANTfAN A AN^TAN AN 

TAGTTATCAGAGTGACTGCG 


1610 


i ^ A ^N A "r"^N A A ^N^*AN A AN ^N ,^N ^N 

GTTAGATCATAGTCACCGCG 


1611 


GTTAGTATAGATTGGCCGAC 


1612 


GTGTTTATACGTTGAGCACG 


1613 


TTATCTGTAGTCATGGAGGC 


1614 


TGATACTGAGTTAGGGAGCT 


1615 


GTGATCTCAGAGGGGAGGTT 


1616 


GAG ATGTCAAGACGCGGACT 


1617 


GTG GTG AG AG AG CG G AATCT 


1618 


CGTG G GAG AG AG CT AG AT AT 


1619 


GTGCCGAGACTCCACTGTTA 


1620 


GGGGAGAGGTCTGGTAGTAT 


1621 


A "^y^ A A A y^ -r- A "1"/^ A A <^ <^ y^~r~AN 

ATGGAGAACTATCAAGCGTG 


1622 


y^ ' ■ ' I A y^ a y^ y^ y^ y^ a y^ y*^ y^ * 

GTGCTTTAGTAGCGGAGCGA 


1623 


-f— A A A ~¥~ A ^^y^ AN~T* A ~f A y^ AN AN y^ AN AN 

TAAATATGGTATAGGCGGGG 


1624 


A A -i 1 ■ AN*"!" A AN^n A ~r A AN y'N yN y~* y-N y^ y^ 

TAATTCTAGTATACGCGGGC 


1625 


■T" AAA -T" y^ y*N^r A y^ T" A y^ A y**^ y^ y"^ ^> 

TAAATCGTATGTAGGAGCGG 






1627 


TCAGTTAtATGAGCCGACTC 


1628 


TCACGTATATTGACTCCGAC 


1629 


TCACCGTATTCGAGGCGACA 


1630 


TCGTACTGATTGACGGTGAT 


1631 


TCACAGCGGTCGAGGTTACT 


1632 


TTCACGCGGTCGCAGTATCT 


1633 


TACTTGACGTGACTGCATCG 
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1634 


CGTCACAGAGGACAGCATAC 


1635 


TCACTAGAGCGTCGAGCTGT 


1636 


TGTACAGTGTGTCAGAGTGA 


1637 


CTACCTAATCGACAGCAGAG 


1638 


caccgataagtacagcaggg 


1639 


CAACGTCTAGGAGAAGGCAG 


1640 


CACTAGCTCAG ACAG ACGAG 


1641 


GACTTTACAGTACGATCAGC 


1642 


gacactgactgacatcgaga 


1643 


GAGACAGTCGAGCGATCAAT 


1644 


GCACTTGTACGTCCAGTCAG 


1645 


GTACACGGACTGCCAGCATA 


1646 


GTAATAGGCTATGAGCAGAC 


1647 


CTAGATAGACATCACTCACG 


1648 


TAGAGTGTGGATGAGGGGTA 


1649 


GACTTGGAGGTACAGGCGAA 


1650 


CTTATGGGACACTAGCTCGA 


1651 


CTG ATG CTAG ACTAGG C AC a 


1652 


GCAGACGCACTATCATATAC 


1653 


GCAGTAGAGAGTTCTGAGGA 


1654 


gcaggtacactgaccgacta ; 


1655 


GCAGATCACTGCACGATAGA 


1656 


GCAATGACTTCGACTGCAGA 


1657 


GACAAGTCATTTACAGGGGA 


1658 


GTAACTTGTTTGACAGTGCG 


1659 


GACACTGCATGGACAGCGTA 


1660 


GCAAGGAGTGAGACATGGTT 


1661 


TGCGAGGTAGGTTATATCTG 


1662 


tggggagagtgatatacttg 


1663 


GGCGTGAGAGGATTATATCT 


1664 


GTGGTGCGAGAGTATTATCT 


1665 


CCGCGTGTACGATATAATAC 


1666 


GAGGGTGGAGGATATAGACT 


1667 


GGCGGTGTACGATTATGACT 


1668 


GTAGGTTGACGATGCTGACT 


1669 


GTGGTGGTAGTAGCTGGTCT 


1670 


TAATGTGACGTAGCCGAGTG 


1671 


TACGGAGTGGGAGATGGTGA 


1672 


TAGGGATGTCGATAGATCCA 


1673 


TCTGGTATAGGATGAGGAAG 


1674 


TCGTGAGTAGGATGGTTTGA 


1675 


tacgtgagatgatgatggct 


1676 


TAGTGGGTAGGATGAGTGTA 


1677 


TAGTTGGAGGAGTAGTGATG 


1678 


TAGGTACAGTGGTGGATAGT 


1679 


ctgggtcaagtgtgtagaat 


1680 


TGTGGGGTAGAGTGTGTGCT 


1681 


GGTGGGTGACGATGTGGTAT 
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Qpn IH 


to com lonoo 


1 DO^ 


O 1 O 1 1 MV^ 1 M 1 OwVw^M 1 \^r\ 


1 ooo 


OV-f 1 OM 1 O 1 MV^ 1 /A 1 L/V^M 1 


1 D0*+ 


PT A AXn A P ATP A (Ti A P 
OL/ 1 MOM 1 OML/OM 1 L>MOO 1 ML/ 


1 DOO 


r^PATPTr^TAPr^ATPTP APPA 
OOM 1 L/ 1 O 1 MV^OM 1 V_/ 1 OMOL/M 


1 QOO 


riPATPAPr^APPATTATPAPA 
OOM 1 V^M^b-fOMOOM 1 1 M 1 OMOM 


1 f\fl7 
\ OOl 


f^PTAPf^TTAPP ATr^Tr^PAf^A 
OO 1 MOO 1 1 ML/OM 1 O ' oOMOM 


1 Quo 


P /^T A TT A P P A T*^ PTP A P A 
OV^O 1 MO 1 1 MOOA 1 OO 1 OML/M 


1 ooy 


PPPTPAPPAPAPTpTATPAfi 
OOO 1 OMOOMOMO 1 O 1 M 1 OMO 


loyu 


PPPTPPPA ATTAT/^TATPAP 
OL/O 1 OOOMM 1 1 M 1 O 1 M 1 OMO 


'i fiO'l 


TPTPP A PAPXXPXX ATA PPP 
1 o 1 ooMOML/ 1 1 O 1 1 M I MooO 


1 oyz 


PPPTP APT A A TTTP APT A PP 
OOO 1 OMO 1 AM 1 1 1 oAO 1 MOO 


loyo 


APPTPPPT AP A A AXr^PTATr^ 

MoO 1 OOo 1 AOAAA 1 OO 1 A 1 O 




PPPAPPPP A APT APPPT AT A 
UOOAOOOOMMO 1 AOoO 1 A 1 A 


ioyo 


PPt APTfSPTA AP;PAPPPTAPA 
OOAO 1 OO 1 AAOOAOOO 1 AOA 


ioyo 


PPATPTTP APAPAPPPP APT 
OOA 1 O 1 1 OAOAOAOOOOAO 1 


loy / 


PATPTAPA APTP APTPAP AP 
OA i O 1 AOAAO 1 O AO 1 OAOAO 


loyo 


PPAPPPPTAAPPATPTPAP A 
OOAOOOO 1 AAOOA ' V> 1 OAv-»A 


ipyx7 


APAPf^PTriAAAPArSTAPf^PP 
AOAOOO 1 OAAAOAo 1 AOOOO 


I7nn 


PATPTPAPAPPTAPPPATAP 
OA 1 O 1 OAOAO O 1 AOOOA 1 AO 


1701 


TPTPPTPPAPPTAPPTPTPA 
1 O 1 OO 1 OOAOO 1 AOO 1 O 1 OA 


17h9 


PPTPPPAPAPATPPPTATPA 
OO 1 ooo AO AO A 1 OOO 1 A 1 OA 


1 70*^ 


PPPPPTATAPPTPPATATPP 
OOOOO 1 A 1 AOO 1 OOA 1 A t OO 


1 70/1 


PPTP ATAPAPTTPPATAP AP 
OO 1 OA 1 AOAO 1 1 OOA 1 AOAO 


1 70*^ 


PPTPAPTAAPPTPPATAPAP 
OO 1 OAO 1 AAOO 1 OOA 1 AoAO 


1 7nfi 

1 f UD 


PPTP ATT APPATPT APT APP 
OO 1 OA 1 1 AOOA 1 O 1 AO 1 AOO 


1 707 


P A ATPPTP APP APP APTAPP 
OAA 1 OO 1 OAOOAOOAO 1 AOO 


1 70Q 


P A APTPTPPTP APP A ATP AP 
oAAO 1 o 1 OO 1 oAOoAA 1 oAo 


1 700 

1 f uy 


TT A PTPTPT ATPPP ATPPP A 
1 1 AO 1 o 1 O 1 A 1 oOoA 1 OOoA 


1710 
1 f 1 U 


PTTATPTP ATPPP APATTPP 
O 1 1 A 1 O 1 OA 1 OOOAoA 1 1 OO 


1711 
I'll 


APPTATATP A APPA APPPTP 
AOO 1 A I A 1 OAAOOAAoOo 1 O 


1710 
1 r 1 Z 


PPTTATAP APTPPAPTAP AP 
oO 1 1 A 1 AOAo 1 oOAo 1 AoAo 


1 71 
1 r 1 0 


TTAAPTAPPTAPPTPPPpTP 
1 1 AAO 1 AOO 1 AoO 1 OOOO 1 O 


171/1 


PAAP APTAAPTPPA APPPPP 
OAAO AO 1 AAO 1 OO AAOO OOO 


17m 

i / 1 D 


PAPTAAPAPATPPAPAPPPP 
OAO 1 AAO AO A 1 OOAO AO OOO 


i71R 
1 r 1 D 


pptaptppapappapatpat 

OO 1 AO 1 O OMO MOO MOM 1 OM 1 


1717 


TP ATPPAPPTPPPP ATA PPT 
1 OM 1 OOMOO 1 OOOOM 1 MOO 1 


171ft 
1 f 1 O 


TPTATAPPPTPPTPPAAPPA 
1 O 1 M 1 MOOO 1 OO 1 OOMMOOM 


171Q 

1 / 1 y 


tpaapppppappppapttta 

1 OMAOOOOOAOOOOAO 1 1 1 A 


1790 


TP APPPPP APP ATTPATPPT 
1 OAOOOOOAOOA 1 1 OA 1 OO 1 


1791 


PP ATPPPP APP A APTPP AT A 
OOA 1 OOOOAOOAAO 1 OOA 1 A 


1 799 


PAATPPPPAPPAATPATPPT 

OMM 1 OOOOMOOMM 1 OM 1 OO i 


1723 


GAATCGGCAGCAATACTGTC 


1724 


GAAGCCCAGCTAAGTGGTAT 


1725 


AAGAGGCCAAACCGGATGGT 


1726 


TAAGCACCTTGCAGGATAGA 


1727 


TCAGCCCGATGCAGGGTATT 


1728 


TATGCGCCCAGGAGGCTTTA 


1729 


TGCCCAGCAGGTGGGATTAT 
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o6C]; 10 


trt tx' iQn^A 
O lO D S6C|U6nC6 


i TQH 
I 1 oU 


1 Moo 1 OOUM 1 LrMU 1 oMV^vJoM 


■1 TQ'I 


PPTPPP AT APP APTPPP ATA 


1 / Oii 


APT A APPPAAPAPPPPAPPT 
AO 1 AAOOUAAUAvjOooAoVj 1 


1 / OO 


P APPTPT A A PP APP AP APP A 
UAoU 1 O 1 AAoOAoOAOAovjA 


1 / o4 


AP/^T/^ AAPPAPATAPPAPT 

OAotj 1 OAAoOAUA 1 AUOAo 1 


1 / oo 


PTP*TP*PA ATPAP^P^PP AP' AP A 

O 1 (a 1 oUAA 1 LtAOoUOAoAVjA 


17ob 


PPP^^i^ A ATA ATPTP AP AP* A 


1737 


P P P A A T A A "TtP A P A A P ~r 


17oo 


A/~^/^/^/^/^A/^A/^A ATA/^A^/^/^T 

AGGGCCAbACAA 1 AuACOCj 1 


1739 


GAGG r GAOAA I 1 1 Gu 1 AG AG 


1740 


CAGGCACAAGATTGAGCAGG 


1 741 


A/^A Ar^f^r^r^ A a ataptp/^/^pp 
AGAAGGGGAAA 1 AG 1 GGGGG 


1742 


AOAATO TG AAA 1 AG GGGGG G 


<i Til o 

1743 


ATCGACCCAAGAATAGCTCG 


1744 


ATA A ^ A V*^ AAP/^A/^/^/^/^/^/^T 

A 1 AAGGAGAAGGAGGGGGG 1 


-1 -7/1 jr 

1 /4d 


A A P A OT^/^ A A A/^PP A PPPTP 

AAGAG 1 GGAAAGGGAGGG 1 G 


1 ^4p 


A ATPTATP A A APPP APP*PPP 

AA I G 1 A 1 GAAAGGGAUGoGG 


1 #4/ 


ATTPP'P AT A A PP* PP P A P P* AP 
Al 1 OGGA 1 AAGGUooALpoALr 


1 / 4o 


ATPPP APP A APPPPPTAP A A 
A 1 oOOAoUAAOoL/oL/ 1 AoAA 


1 /4y 


ATP^PTPAPA APPPAP/^ AP* AP 

A 1 vaO 1 UALfAAGGGAOoAoAvj 


1 /OU 


ATP'PTPP A APP ATAP ATAPP* 

A 1 oG 1 GGAAOGA 1 AGA 1 AGG 


1 A Ol 


P APPTTP A AP A PT AP ATAPP 
L/AoO 1 1 OAAv^Avj 1 AO A 1 AOo 




ATP TP AP A APPPP AT APAP 
OA 1 o 1 OAOAAoooOA 1 AoAO 


^ 7CQ 


P ATPP'TPTA APPPPTAP AP^ A 

GA 1 KjKd 1 G 1 AAvjGGO I AOAoA 


'i 70I 

1 /^o4 


AP ATP»P*PP A A AP*P APP APP'T 

AGA 1 GGGGAAAGGAGGAGG i 


1755 


GT \ AGTTGAA 1 GGAGGGAGG 


1 / OD 


PP*P/^ A/^TTA ATP*PAPP* AP AP 

GGGGAG 1 1 AA I G GAG GAG AG 


A 7C7 

1757 


OAP/^A/^^A A/^T/^P^ APT AP AP 

GAGCAGCAACTGGAG FAG AG 


-1 7CO 
1 (DO 


/^PP A APTP A AOTPPPPTAP A 

GGGAAG 1 GAAG 1 GGGG 1 AGA 


A 7cn 

1 /oy 


/^P A/^TPTP AATA APAP* APP^T 

GGAG 1 G 1 GAA I AAGAG AGG 1 


17oU 


r^r^Af^f^r^/^A a/^t^ atpp*ta a a 
GGAG GGGAAG 1 GA i GG i AAA 


A TCA 

1761 


r^^T/^/^TAP a ATP a OTA op a A 

GG 1 GGl AGAA 1 GAG 1 AGGAA 


A 7CO 
1 fOZ 


PTAOTOPPA ATP ATP AO AP A 

G 1 AG 1 GGGAA 1 GA 1 GAGAGA 


A 7C5 


P A ATOPO A AOTPAPTAO aPP* 

GAA 1 GGGAAG 1 GAG 1 AGAGG 


A "7C A 


O A TOOOOT A r^r^ A ATAP/^TAP 

GA 1 GGGG 1 AGGAA I AGG 1 Avjj 


1 /DO 


A A OTP PPPP A A A T A A PTPPP 

AAG 1 GGGGOAAA 1 AAO 1 oOO 


1 f DO 


P A APOOPP A ATA PAP A/^/^r^T 
UAAGGGGGAA 1 AUAOAooo 1 


H 7C7 


O ATPTOPP A ATOPOOOO ATA 

GA 1 G 1 GGGAA 1 GGGoGGA 1 A 


'i 7CQ 
1 fOO 


O ATTOOPP A ATPTPPPPT A A 

GA 1 I GGGGAA 1 G 1 GGGG 1 AA 


A 7CQ 

1 /by 


P APPPPPP A ATPTP APTAP A 

L?AvaGGGGLrAA 1 o 1 oAO 1 AoA 


A 77C\ 
\ f f\J 


/^PPPPP/^Pi A AT/^ TP r^TATAT 


1771 


GCCGCGCCAATGTTACGTTA 


1772 


CTTCGCCCAATGCGTAGGAA 


1773 


TTCCCATGATCGCTGACGAG 


1774 


TTGCGGGAGCTGCCTCTTAA 


1775 


TTTCCCGGATAGCCGCTGTA 


1776 


TTTGCTGGAGTATGCGCTCA 


1777 


TTGTTCTCAGCTTGCGGCAG 
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becj. la 


o to D sequence 


1 ( fO 




■i 7 7 n 


lull (jCjVj I AtjOA 1 U 1 o 1 OAU 


1 fou 


I vjCjvj 1 (j 1 UAuOA 1 O 1 ALrtjOA 


17oi 


T 1 CjTGGOAvjC? I A 1 1 CoAA 


A "TOO 

1782 


GTT GGGCACGGATOTOTA T A 


1783 


*^/^/^/^ A /^/^/^ A /^/^ A "T/^/^T — TATA 

G C C G AG G C AC C ATG C TTATA 


1784 


CGCTTGGGACAATCGCGTAT 


1785 


CCGCAGGGAACTTCAGCATA 


1786 


TG G AG G G C AGTCTGTC AT AA 


1787 


CTGGGTGCAAGTTGTATCAA 


1788 


TGGCGCACATGGTGTCATAA 


1789 


TGGGATCACTGGTGCGGAAT 


1790 


TGCCAGTCATCCTAGCGTGT 


1791 


TG AGGCC AG GAGTG CTTATC 


1792 


TTGGGATAGGAGTGCTTCTA 


1793 


1 1 1 GCAGACGGTGTGCTATA 


1794 


TTGAGTCAGGGTGCOCAACT 


1795 


1 1 1 AATA 1 GGTTGGGGvjACjG 


1 yb 


1 GAGGA 1 GA 1 GAGGA 1 G 1 AG 


1797 


r^T/^A A/^^~rr^/^r^ A/^ A Ar^ Ar^T A 
G I GAAGO 1 GGGAGAAOAG I A 


1798 


1 GAGAAG 1 oGG 1 GGA 1 GA 1 A 


1 /^yy 


1 G 1 GAGA 1 GGG 1 GGAGGA 1 1 


1oUU 


G 1 AG 1 GAG AG 1 GAGGAGGGA 


A or\A 

1801 


TGGTAGGGAGTGTGGAGGTT 


1802 


TACGTGTGACTATGGTCGAG 


1803 


TATAGTTAGGTGTGGGACGG 


1804 


TACCGTTACGTCGCTCAGAG 


1805 


CACTAGAAGGTGGTACAGAG 


1806 


AT AG GTATAAC G GAG TACG G 


1807 


A "T A A /^T A A /^/^ A T" A /^Ti^^ 

ATAGCAGTAACGCATAGTCG 


1808 


ATAATCGTAACGCAGGGAGG 


1809 


ATGAGTGTAAGGCCTCGACA 


A O A r\ 

1810 


ATGTAGGGAACGTACTGAGA 


4 OA 4 

181 1 


ATCTAGCGAACGGAACTATG 


1812 


G 1 AG AG 1 GAGGA 1 GGAG 1 AG 


1813 


GTAGTATGACGTAGCAGTAC 


A OA A 

1o14 


G 1 AGG 1 GGAGG 1 AoA 1 GGG 1 


A OA C 

1815 


G AGTGTGT AG G AG GT ATG AT 


A OA ^ 

181o 


^ /^T^T/^TT A A A /^T A A T 

Go IGTCTTACAGGAGTACA \ 


A OA^ 

1817 


/^/^T/^/^/^i^T A /"^ A /^/^ A /^T/^ ATT 

GG TGGGG 1 ACAGGAG 1 GA 1 i 


lolo 


o 1 AoOU 1 AoAL/oOAVjj I Oo 1 A 


1819 


CGTCTCGCAAGTCGCGTATA 


1820 


AGTCGCGCACAGCAACGTAT 


1821 


ATCGAGGTAACGCCATATAC 


1822 


CTCGTGACATAGCCATAGAT 


1823 


ATGCGACGAAGGCGGATATA 


1824 


GTAGACAGACTGCGACATAC 


1825 


TAGTCGTAGAGGCGCTATCA 
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oeq. 10 


O lO 0 S6C|U6nC6 


1 OZD 




1 OZ / 


r'T^ir'/^T ATAr* ao atp a atpp 

L» 1 oUo 1 A I AoAoA 1 L/AA 1 


^ Q O Q 


UUUUo t A 1 Avj AUAoA 1 A 1 oA 


Jozy 


r^Tt^r^ r^~i — r a/^oa/^a/^a r^Tf~^r^ a 
U 1 UUO I 1 AOUAOAoAU 1 UoA 


looU 


UoUoOAOoAIjAUA 1 AVijU 1 1 A 


Ippi 


A/^/^OT/^ A/^A/^A/^AAOA r^T/^f^ 


lopz 


OO 1 AUoAVjALtAUA 1 oAOACjjo 


looo 


CGCCGAGTACACATCjCAGAT 


1oo4 


ATA A A /^T/^ a A T" 

CCGTCG ATACAGACT C AG AT 


looo 


O 1 L/G 1 UAGAOAGAGUGGA 1 1 


A OOO 

looo 


G 1 U 1 OGOOAOG 1 A 1 OGGA 1 1 


'I 0 0~7 


1 G 1 GGGG 1 AG 1 1 AGGGA 1 GA 


looo 


GTCT CGGTACGATGTAGCAA 


looy 


CGTGTGAGACAGTAGCATAT 


1o4U 


GG 1 G 1 AGGAGAGGGAGGA i 1 


A QA A 

1 o41 


G 1 G 1 AGG 1 GAG 1 GAGGA 1 GA 


AQAO 

1o4z 


A/^^T A/^ AXA A/^^^OT A/^ AT/^/^ 

AGG 1 AG A 1 AAGGG 1 AG A 1 GG 




ir*xr* T A o A a A Tr'Tns A A Tr* 

U 1 G 1 AVjAGAoA I U I \jAA I GG 


I044 


C'TC* AT Ar*r* A Ar!lTr*TT AT/^r^/^ 
U 1 GA 1 AOGAAV3 1 A 1 


•1 QA R 


r» AV^r^ f^Tr*r^ A A A r*T A A A 

GAGGG 1 GGAAGAG 1 AA 1 GAG 




A <^ o A T A A <^ A p T A T A r* 

UALfOV-rV^A 1 AAVjAOo 1 A 1 AVjO 


1 0'l- / 


p T A p A T A A T p T A T r:i P A P 

o 1 AoOAvj 1 AAo 1 U 1 A 1 VjL/AO 


AQAQ 

1o4o 


r*/^XAPTT/^ A APTP ATPO Ai^ A 

GG 1 AG 1 1 GAAG 1 UA 1 GGAGA 


A QACl 

io4y 


/^/^^P AT" A/^ A APTP APr^ A P AX 

GGGGA 1 AGAAG 1 GAG GAGA 1 


1obU 


^ APPP A PP aP AXr>XP APP AX 

GAGGGAGGAGA i G 1 GAGGA 1 


A QUIA 

Tool 


GA 1 AGAGGAA 1 AGAGGGGGG 


1o02 


A x/^ A ^ A <^x A r*~r a o a 
GA 1 GAGGAGG 1 AG 1 AG GAG G 


looo 


A #^ A X A T A A PP A A PTPXO/^^P 

AGA 1 A 1 AAGGAAG 1 G 1 GGGG 


A OCA 

1oo4 


<^AT — r A X A A /^X A PX/^ A /^P/^/^ 

GA 1 1 A 1 AGAG 1 AG 1 GAGGGG 


A QCPi 

looo 


/"■^ A XXX A~rA/"»XA/^A/^ I /"^/"^ 

GAG M 1 A 1 AG 1 AGAG I GGGG 




GTCAC TTACGCTCAGGCAGA 


loot 


T^O^X A^ AO/^/^X/^XOOP AX A 

1 GGG 1 AGACGG 1 G i GGGATA 


lobo 


G 1 AGGG 1 GAGGAU 1 GGGA 1 1 


AQPiCi 

Tooy 


A PPP/^^XA AXAOXr^X/^ A/^ A 

GAGGGGG i AA 1 AG 1 G i GAGA 


A QCir\ 
lopU 


GGG 1 GGA 1 AGGAG 1 GGGA i A 


A QdA 


XOXAPX/^XA/^XPP AXPOX/^ A 

1 G 1 AG 1 G 1 AG 1 GGA 1 GG 1 GA 




P X A X A P X A P A P r* X P P P A P P 

vj 1 A 1 Ao 1 UAoAoLr 1 ooGAGG 


A Qd^ 

Ipbo 


/^/^X/^ AOX/^A A/^XAXOr^/^ Ar^ A 

GG 1 GAG 1 GAAG 1 A 1 GGGAGA 


A QdA 

loo4 


APP AP APXA A AXAXPPXPPP 

AGGAGAG 1 AAA 1 A I GG 1 GGG 


lobo 


axapap/^Pa a/^p ax aoxx/^p 
a 1 agagggaagga i ag 1 1 gg 


1 ODD 


ATP Tr^ A PT A A P AT<^ A Tri P P 
A 1 O 1 oMO 1 AAOvjA 1 VjA I 


1867 


GtTGTAGGACGTATGATCTC 


1868 


TTAGTCGAGTCTATGAGCCC 


1869 


CGACGATACAGTAATCTAGC 


1870 


CTGATACAGGCATAGACATC 


1871 


GGTATCAGAGCTAGGACTAT 


1872 


TCTATCTCAGCTACGGTCGA 


1873 


TCAGTTCGATCTACGGCTAG 
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Seq. Id 


3' to 5' sequence 


1874 


TCAGTGCGACTCAGGTACGA 


1875 


GTCACTGCACTCACGGTAGA 


1876 


TAACGAGTCTTCAGCACGTA 


1877 


GAAGTCGCCTACATAGCCTA 


1878 


GAAGTCCGTTACATGACCAT 


1879 


GTCAGAGGATCGAGCCACTT 


1880 


GCGAGACAGGTCAGTACAAT 


1881 


CGTCAGAAGGCTCGCACATA 


1882 


GCATACAGGTTACGACGCCT 


1883 


GCGATACAGGTTCAGAGATA 


1884 


ggacgcatagctcgcagtAt 


1885 


GGACGCAGATCGCAGCATAT 


1886 


CGGCGTTAATCGCAGAGAAC 


1887 


CGCGTTCTAAGGCACGGATA 


1888 


CGCGTCGCAAGGCTGTTATA 


1889 


CGATACGCAAGGCTACGACA 


1890 


CATCTAAGGACACTACACTG 


1891 


TATCATCGAGGACTCAGTGC 


1892 


CACCGAGCAAGACTGACATG 


1893 


CGCACCCGAAGTCAGAGATA 


1894 


CGGCTAGGAAGTCAGCATAA 


1895 


atgctgcgaacgcgccataa 


1896 


CCGCGTGCAACGTGTTCATA 


1897 


GTCGCTGCATAGCATCTGAG 


1 898 


GTCTGTGCATAGAGGGTCAT 


1899 


GTGGTGTCACTGATACGTCA 


1900 


GGTTAGCAeTAGATCGCACT 


1901 


CGGGATCTACAGCAtCATAG 


1902 


CTGGATATACAGCACTCACA 


1903 


ATGCGGGTAACGCCTCATAA 


1904 


TCGCGGCGCACTCTGTTATA 


1905 


TCGTGCTAGTGGCACTGTAT 


1906 


TAGGACACTTCGCCACTATG 


1907 


TATGACAGTTCGCGCTACCG 


1908 


TC G C G C AGTT AG G C CT ATGT 


1909 


TAGCCACCGTAGCTGATCGT 


1910 


GTAACCCGCTATGAGATGGA 


1911 


AGAGCGGAACACCACATTGT 


1912 


AG G CT AAG AAC G C AC ACTCG 


1913 


G AG C CT AG AG AG CTTC AT AG 


1914 


y^ y^ A y^ ' i r A y^ A ^^T' A y^ a t 

GGCAGTTCACGACTCGACAT 


1915 


GGCCTTAGACGAGTCGCATA 


1916 


GGTCGATCAGCACTGCATAC 


1917 


GGAGAGTCAGCACAGTCCTA 


1918 


GTATAGGCAGCACGGCTCAT 


1919 


GCACGGCGAGCACTATCTTA 


1920 


TAACGTGGTGCACGATGTGT 


1921 


GGACGCCTAGCACATCTGAT 
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oci^. lU 


com lon^A 






i QO'k 


r^p A p A TPf^ A P A P Axri P AriT 


1 OOA 


r^P APr^APPAriPTPTTAPP AT 


1 yzo 


PPP APPAPAPAP AT AP A PPT 


1 yzo 


r^nr^r^ f\.nr^r^ t^f^r^ /\ atap atap 
OUOoAUoUAOoAA 1 AoA 1 Ao 




PPP APPAPAP ATA P ATP APT 




Lr 1 1 LrULfOOAoLr 1 ALrA 1 AoA I 


■1 QOQ 


PPpTPPP A APPXPPP at a AX 


lyoU 


PP PPPPPT A A /^/^ A aV^ A A ATT 

UoOOUUvj 1 AACjUAAUAAA 1 I 


1 yoi 


P/^ A PPr^TP A A O A /^"T/^ A "T A A 

OoAotjO 1 UAAULjAO I UA 1 AA 


lyoz 


uCjpOAOAU 1 AACjiCjiA 1 UA 1 1 AO 


1 yoo 


AP APAPPP A AP A APPTP/^PT 

AoAUAOoL/AAoAAoO 1 ooO 1 


iyo4 


PPAPPPATAPP AP APP ATpT 

oLrAUoOA 1 AoUAoAooA 1 U 1 


lyoo 


PPT APPTP APTP APP APP'AT 

oLr 1 AUo 1 UAO 1 oAoUAooA 1 


lyoo 


PTAP AXPXPPTP APP AP APP 

o 1 AUA 1 U 1 1 oAoOAUAoLr 


1 yj/ 


/^TAP APP APTTP AP APP A AP 
Lr 1 AOALfVjALr 1 1 VjAoAOoAA^j 


\ yoo 


PTA APTAPPTPP A APP A APP 
W 1 AAo 1 AOvj 1 oOAAoOAAoo 


1 yoy 


P AP APPTAPP AP APPTATPP 
oAL^AOvJ 1 AooAOAOL/ 1 A 1 


1 y'fu 


P AP ATAPTAP AP ATPTPAPP 
oAO A 1 Ao 1 Ao AOA I O 1 OAOo 


1 Q/li 


PAPAPPPTAP APATPr^TPAP 
oAoAoOO 1 AvjAOA I v/Vj 1 LrAVj 


1 y'fz 


PAPTATPAPPAPATTPAPPP 


1 y^o 


P ATPT APAPPPT APP APTPP 


1 y44 


PPTTAPTAPPP ATAP ATP AP 
oLr 1 1 ALf 1 AOoVjA 1 AvjA 1 OAij 


1 y'fO 


PPPTATPTA ATPP APTAPP A 


1 y4D 


P P P T A TTT A P A P TP A P P P A P 


1 y4/ 


PPPTATATPP A ATTP APTPP 
oUVj 1 A 1 A 1 UoAA 1 1 oAo 1 oO 


iy4o 


PPPTTP AP AP APTPP APP AT 

oUo 1 1 OAuAoAo 1 UOAUoA 1 


iy4y 


P/^PPtat/^ A appTP aPo aP A 


1 you 


PPT aTt"aPaPTPxP aP A/^ AP 
1 A 1 1 ALrAo 1 \j \ UAUAoAU 


1 yo 1 


PpTP AP ATA APPTP APTT aP 


1 yoz 


PPXPXPTP a APPT"/^ APPT a A 


1 yoo 


TATTAPP APTPPTP APP APP 


1 yo4 


ATPTTATP A APPTP A PPP A P 


1 yoo 


PPP ATAPTAP APTP APPP AT 
ooOA 1 AO 1 AoAvj 1 OAoOoA 1 


1 yoD 


APTPPPATAPA ATAPPAPPP 
Ao 1 oOoA 1 AOMA 1 AOoAoOo 


\ yo/ 


PAPPAPAPAP APT APAPPPT 
OAo^AL/AOAoAo 1 AoAoOo 1 


1 yoo 


PPTAPP ATA APPTPAPP APP 


1 yoy 


PTPPATAP APPTTP ATAPP A 
\j \ LrOA 1 ALjAOVj 1 1 VjA 1 AOOA 


1 you 


PPTAPP ATAP ATP APPP APP 


1 yo 1 


PPP APTAP APPAP ATPP AP A 
OooAOi 1 AOAOOAoA 1 OOAoA 


1 i7DZ 


PAPPPTATAPPAPATPPAAP 
OAVjV^v? 1 A 1 AOOAVjA 1 L^OAAw 


1963 


GACTGTAGAGAGACGATCCA 


1964 


CTAGTAGGAAGTGCGATCAA 


1965 


CGTAGAGGAAGTGATACTCA 


1966 


CGTATGGGAAGTGAGTATCA 


1967 


CTATGACGAAGTGAGAGTAC 


1968 


GTTCGTAGAGATGATGGTCA 


1969 


GTTCTCAGATAGTATGCAGC 
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Seq. Id 


3' to 5' sequence 


1970 


AGTCTGTTAAGATATGCGCC 


1971 


AGCACGGAACAGTAAGCCCt 


1 972 


ATCCAGAGAACGTGAGATCC 


1973 


GACAGTGTAATATGAGGACC 


1974 


CATAGTAGAAGATTCGAGCC 


1975 


TGAGATATAGTATGCGGCCA 


1976 


ATGAAGATACTATAGCGCGC 


1977 


TTCTCTATATCGTGCGCOGA 


1978 


rrGAGTTTACGTGTATGGCAC 


1979 


ACGGCATCAAAGTTGCATAC - 


1980 


AGGGGCTCAAAGTATGATAG 


1981 


AGGCGCTTAAATGTGGATAC 


1982 


CTGGCGTTAATGGCGGACAT 


1983 


CTGAGCCAATAGGCGCACTT 


1984 


TAGGGATGATGAGAGCTATC 


1985 


TGCCTATGAGGAGTATGAAC 


1986 


GGGGTATAATGAGCTTGACT 


1987 


TAGGCTTCATCAGCTATCAG 


1988 


ATTGCTTCAACGGGCATTAC 


1989 


TATGATCCATGCGACTCGGA 


1990 


TTGTATCCATCGGCCCAGTG 


1991 


ATCAAGGCAACCGCCAGTAG 


1992 


TCTGAGCCATCCGTG ATAGG 


1993 


TATCAGGCATCCGAGCATAG 


1994 


TTAAGCTCCTCAGTCCATGT 


1995 


TAAGGGCGATGAGCCTATCT 


1996 


TAAGGCCGAGGAGCTTTCAT 


1997 


TAAGGCAGTGGAGGGGTCTA 


1998 , 


TGGACAGGCTGCGCTCTATA 


1999 


CTGGAAGCCTGCGACCAAAT 


2000 


TCAATGCACtGAGCCCGAGA 


2001 


GATTCACACTGACCCATGTA 


2002 


TAAATAGATTGGAGACGCGG 


2003 


GCATTAGAAGGTCTGGACTA 


2004 


ATTGGCATAACGTATTGCGC 


2005 


CAGGACTGAAGATCGAGTAC 


2006 


TAGAGTGAGTGATAGCTCGA 


2007 


1 1 1 A I CG 1 AGO I GGC 1 GCCC 


2008 


AGGATTAGAACCTACGCACC 


2009 


GCCGTGAGACCACTGTACTA 


2010 


GACGCTGAATCCTATTGACA 


2011 


CGCCTAAGGATCGTGAAGTA 


2012 


CGACGACGAAGCTGCATGAA 


2013 


ACTCGAATAACAGCATCTCG 


2014 


CCCGTAAGCATGGCACAGAT 


2015 


CAATACAAGATTACGGCCTC 


2016 


GATCAGAATCTATGGTACGC 


2017 


TCTGTGTACTGCTCGCCAAT 


2018 


ATATTTGGAACGCAGCTCAC 
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Seq. Id 


3' to 5' sequence 


2019 


TGCAGTATGGCAGCGGTTCTA 


2020 


GGGCAATGTTTATCCACAGA 


2021 


CTGACCGAATCCAGCAGAGA 


2022 


GATCGTGAATCCGCGCACTA 


2023 


GAGCGGTAATCCGAGCGATA , 


2024 


TACTCCTGACGACTTAGGCA 


2025 


TGCTGTCACTCGGCGTCTAT 


2026 


GTACTAGCATATCATCGACG 


2027 


TATGGCATAGATCAGTGAGC 


2028 


TACGGGCAGGCAGGTACTTT 


2029 


GTTCATCACGAGtGCGTAGA 


2030 


GATGTATGAAGATGGCTGAG 


2031 


GGTCGCGGATTGGAGCATA 


2032 


GGAGATATCTAGGGACATCT 


2033 


ACGCGGGTAAAGGTAGATAG 


2034 


CACTGCCGAGAAGATGTAGA 


2035 


GGATTTAGATGGCGTAGGAA 


2036 


gatgagacagaatggaccgt 


2037 


agaggcataaatgagtctcc 


2038 


tgagtagtaggttaggggtg 


2039 


GGATAGGGAAGGAGTCCAGA 


2040 


ACACTCTGAAAGACGCGACG 


2041 


GTCTTAATGTTGGGGAAGG 


2042 


GTTATCGACTAGGGTGTAGT 


2043 


TCGTGAGACCGTCGTCAGTA 


2044 


GACAGCGCAGTAGAGGTAAT 


2045 


CGTAGAGTAAGTATGATGGC 


2046 


TAGAGCATGTGACGGTATGA 


2047 


GTCAGGATTAGTAGGCAGG 


2048 


TGGTACGTGTATTCAGGGCG 


2049 


TTAATCCGCTGTAGCCGAAA 


2050 


TTAATTGAGTTCGGTCCAGC 



In accordance with one aspect of the present invention. Tag genes were made by 
annealing and extending overlapping 23 to 192 oligonucleotides randomly chosen from 
the 20mer Tags or their complements from Seq. Id. Nos. 1-2050 asembled head to tail. 

In accordance with the present invention. Tag genes preferably comprise 5 to 
1000 randomly chosen 20mer Tags sequences from Seq. Id. Nos. Ir2050 or their 
complements. More preferably, Tag genes compriselO to 500 randomly chosen 20mer 
Tag sequences or their complements. Still more preferably, Tag genes comprise 20 to 
200 randomly chosen 20mer Tags sequences or their complements. 

In accordance with one aspect of the present invention, a Tag gene is incorporated 
into a vector having a first promoter sequences 5' to the Tag gene and a poly(A) tract 3' 



50 



PATENT 
Attorney Docket No.:3502.1 

to the Tag gene such that a sense polyA^ RNA is generated frorn transcription initiated 
from the first promoter; a second promoter sequence is located 3' to the Tag gene and on 
the opposite strand as the first promoter such that antisense RNA can be synthesized from 
the second promoter of the Tag gene. The choice of synthesizing sense or aiiti-sense Tag 
5 gene sequence will depend on the ability of the transcript to bind to Tag probes place on 
the nucleic acid array. In accordance with one aspect ofthe present invention, one or . 
more endonuclease restriction sites may also be incorporated into the Tag gene contructs. 

Preferably, in accordance with one aspect of the present invention, the first 
promoter is a T3 promoter. In a preferred embodiment the second promoter is a T7 
10 prornoter. Transcription can be performed either in vivo or in vitro, in accordance with 
the present invention. It is also preferred that the nucleic acid array is an Affymetrix 
GeneChip® Array. 

In accordance with one aspect ofthe present invention, sense RNA containing the 
Tag gene sequences and the poly A tail synthesized from the first promoter can be spiked 

15 into samples, containing for example mRNA, and subsequently hybridized (after 
labeling) to a nucleic acid array having appropriate Tag probes (i.e. probe sequences 
complementary to the Tag gene in question). With a nucleic acid array having the 
appropriate Tag probes, spiking can serve as a control for various aspects of the assay 
process such as variations in sample preparation, hybridization conditions, and array 

20 quality. In accordance with one aspect of the present invention, anti-sense transcripts of 
the Tag genes can also be used as control spikes for a nucleic acid array having 
appropriate probes. 

In accordance with another aspect of the present invention, the synthetic Tag gene 
DNA itself can also serve as spikes in applications involving genomics. For example, 
25 Tag gene DNA could serve as a control for PCR, including long range PGR, fragment 
labeling, sample preparation and as quality control for the nucleic acid array. 

The invention will be further illustrated, without limitation, by the following 
examples. 

EXAMPLES 

30 Example 1 

Construction of cloned svnthetic Tag Genes 
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In one embodiment, thirteen different Tag sequences of varying sizes were 
designed by randomly assigning 20mer GenFlex™ Tag sequences chosen from Seq. Id. 
Nos. 1-2050, set forth above, to groups, and orienting the sequences head to tail. 60mer 
oligonucleotides were designed to encode the desired genes as well as flanking sequence 
5 used for assembling and cloning the genes. The gene assembly with unpurified 60mers 
can be accomplished by polymerase extension of the annealed ohgonucleotides as 
depicted in Figure 1 and described in U.S. Patent Numbers 5,834,252, 5,928,905, and 
6,368,861 and in Stemmer et al. (1995) Gene 164:49, each of which is incorporated here 
by reference. 

1 0 Oligonucleotides, nucleotides, PGR buffer, and thermostable DNA polymerase 

are combined and subjected to temperature cycling. After about every 30 temperature 
cycles fresh buffer, nucleotides, and polymerase are added to replenish the reaction. 
Each oligonucleotide serves as both teniplate and primer, and because of the 
oligonucleotide design, the extended products continuously grow in a spiral of 

1 5 concatamers that can reach over 50 kb. 

Following assembly of the oligonucleotides into concatamerized products, 
monomers for cloning are prepared by digestion with restriction enzymes either directly 
or following amplification by conventional PGR with flanking primers. The digested 
nionomers are ligated to the plasmid vector pSPORTl (Invitrogen Life Technologies, 

20 Carlsbad, CA) (see Figure 2) and the constructions propagated in the E. coli strain DH5a. 
Subsequently two features useful in generating poly(A) sense RNA are added to each 
construct: a T3 RNA polymerase promoter upstream of the gene, and a poly(A) tract 
downstream of the gene. The 13 genes constructed are named TagA, TagB, TagC, TagD, 
TagE, TagF, TagG, TagH, TagI, TagJ, TagN, TagO, and TagQ. Two additional 

25 constructs, called Big Tags, were made: TagI and TagN are combined to make TagIN, 
and TagI, TagN, TagO, and TagQ are combined to make TaglQ (see Figure 3). TaglQ is 
then altered by site-directed mutagenesis to add two restriction sites, EcoRI and Xbal, 
and the resulting construct is named TaglQ.EX. These additional restriction sites make 
construct TaglQ.EX useful for as a genotyping assay control (see below). Fluorescent 

30 dideoxy DNA sequencing was used to determine the sequences of all the constructs, 

which are shown below, Organization of a synthetic Tag gene and flanking sequence in 
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the Tag gene clone is shown in Table 1 below. The actual sequences of synthetic Tag 
genes and flanking sequence in the Tag gene clones are shown in Table 2. The T3 and 
T7 RNA polymerase promoters and the poly(A) sites are underlined, and the Tag 
sequence is in CAPS. The DNA sequence shown is the sense (Tag) strand. The length of 
5 each Tag sequence is given. 

The sizes of the Tag sequences in constructs TagA through TagQ ranged from 
467 to 1000 bp, with a total of 9808 bp; the TagIN construct has 1944 bp, and TaglQ has 
3849 bp of Tag sequence. There are a total of 78 base pairs different from the designed 
sequence, a rate of 8 bp per thousand; these changes are fairly evenly distributed and 
10 probably arose from polymerase errors made during the assembly and reamplification 
reactions. There are in addition 3 deletions of 12, 36, and 90 bp, the latter two of which 
are caused by the introduction of ari unexpected restriction site that led to truncation of a 
gene during cloning. The synthetic Tag sequence in the plasmids does not appear to 
affect bacterial grovvlh, and the plasmids are stable. 
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Table 1 

Organization of a synthetic Tag gene and flanking sequence 
SphI recognition site - T3 promoter - spacer - TAG GENE spacer - (A)21 - PstI 
recognition site - spacer - T7 promoter 



SphI T3^ TAG GENE ^ 

gcatgc aattaaccctcactaaagg gacgcgtacgtaagcttggatcctctagaXXXXXXXXXXXXXXXXXX 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
25 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

XXXgtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 

poly(A) PstI ^T7 

30 Table 2 

Determined sequences of the synthetic Tag genes 

TagASOlbp 

gcatgc aattaaccctcactaaagg gacgcgtacgtaagcttggatcctctagaATTTGATCGTAACTCGGGT 
GACCAATGACCATATACGGCGTATTAAGGTTGTACCCTCGGTCTCAACTTGTC 



53 



PATENT 
Attorney Docket No.:3502. 1 

GTATGGGACTTTCAAGTACCTTAGCTCGTCGGACGCTTTAGATGACTTATCCA 
TAGTCCTAAGTCCGGCGCCGGTTAAGCCGCTATTAGCGTGTGTGGACTCTCTC 
TAGGAGCGGCTTCGCACAAATTACTGCTCAATCCTAGATACGTTGCGCTCTTT 
GGTAAACGGCTCAGATCTTAGCACTCGTGCAGTTCTACGATGGCAAGTCGTG 
5 CCTCGTTCTCGTGTAGAATATCAGCTAATAGGGTCGGCTCAACAGTGTATCCG 
GTGGACAAGCACTGACACGCGATGACGTTCGTCAAGAGtCGCATAATCTCAG 
AATCCGTACAGCCGCATCGGGTTCACGGCTATAAAACAGCGTCATCAGCGTA 
GGGTATCGCTTCGCGTGTCATGACTTGGGCCACGTCTCTCTCTCGCACATTAG 
GCTAGATTgtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaaac tgcagcgtaccagctttccc tatagtgagt 
10 cgtatta 

TagB467bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaTTTAGTCGTTAGCCCGAGC 
TTAACTATTAGCGTCGGTGCTATATCCTTACCGCGTATGGAGTAGCCTTCCCG 

1 5 AGCATTTGTCTACCTTACCGTCAAGAAAACCATCGACTCACGGGATATTGACC 
AAACTGCGGTGCGATTAACTCGACTGCCGCGTGAACAACGATGAGACCGGGC 
TAAGGCACGTATCATATCCCTAATTCGCTGAATAGTGCCCTACATATCCTAAT 
ACAGGCGCGACGAACCTTATACTCGATGGAAGACAGTTATACCCATGCATAA 
AGCTCTATACTCCGAGAACTAGCATCTAAGCACTCGGCTCTAATGTTAAGTGC 

20 TCGACCACAGATCGAAGGTCGGAACTCCAGTGCCAAGTACGATGGCTCACGT 
CTTATTTGGGCCGCCAGAGTTATGTTTGAGTCTTCGATGTATGCGCTCGTTGC 
CCTATTGTTGTGTCGGATCTTCTAGTTgtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaaa c 
tgcaggcgtaccagctttcc ctatagtgagtcgtatta 

25 TagC579bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaTGTGATAATTTCGACGAGG 
CGTTACATATTCTGAGAGGGGTGATTAAGTCTGCTTCGGCCTGGGATGGTCTG 
TCTACGTGTGCGTAGTTCTGTCATAGCGTCGAGGATTCTGAACCTGTCCATAG 
TATCCTGTAAGCGTCCAATGTACCTATATCGTGGACCCAAAGTCGATACGTCC 
30 GATTAAGCGACGTTGGTCTAGGTAACGAATTATACCCTCGGGTTACGAATTAT 
GGCTGTGCCTAACGAATCTGGGACGTGCCTAAGTAATCTGGTCCGCGACTAA 
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GATGTACGGTGATCGTGGACGCTTGACCGGACTTATGCGTCGCCTTCCGAGTT 
ATTGGATGGCGTTCCGTCCTATTGGATACTATTCCGTGCGTGTGCGACACGTT 
CCGAGCATATGCTAACAGTTCCGTCACTATGTAACGCTTGACGTAGATTGCTA 
TCAGGTTACGATGACTGCTAAGCCATTACGCGACATTCTGCAAAGTTACGTCG 
5 CATTCTCTCACGTTACGGCTGATTCTCTAGGCTTACGCGCATGAGCTCTAGGT 
TCCGGGTACTATCGAACGTGTCATTGGTACTgtcgacccgggaattccg gaaaaaaaaaaaaaaa 
aaaaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 

TagD519bp 

10 gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaATAGACTAGCCTGCCGGTC 
AATAACTGATGACGCGGAGTCAACCTGATAACCCATAGCGGAACAGTCTAAC 
CTACGCGAGATACGTCTTACCGCACATAGGTAACCTATTCGTGACTAGCAGG 
CCTTATTCCGGTGCTATGAGTATCTTACCTGGTCTAGGTATCTAATTCGTGAG 
TCGGGTACTACATTCGTGCGATGGGTCCTCGCTTCGTCTATGAGGTCTCGTCT 

1 5 TGGTGAGTGC AATGTATCCGAAGTCGTAGTGATAATATGGAACTAGGCGCGA 
TTTGACGAACGTATGCCGCATATTCGGAACGTCGCCTGGAAATTCGCCACCTA 
GATCGAAATTATCGGAACTCGTCGCTTATTTACGAACCTTGGGAGCCGTTCCT 
AAAGCTGAGTCTGGTTTCTTATTAGCGAGGAGCATTTCGTGAATACTGAGCCG 
AATATCGTAAGACATCCGCGAGCGACTGTAAACTAATCGGGGAACTTATTAT 

20 AGAGCCGGTCCAGGTCTTGAACGACGTgtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaa 
actgcaggcgtaccagctttccc tatagtgagtcgtatta 

TagE578bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaCCATCCGATTAAATACCGT 
25 GGATTACGTTAAGTTACGGCGGTTGACTTAGTTATGCGAGGTTCGCTTACGTT 
GCATAGCGGATCGCTTAACCTCTATGCGTACAGCTTACCTACTATGCGTGCAA 
GTTACCGAGCTGACGTCGCGTTAGACAGCTCATTCGTCACGTTTAGGACTATG 
TCGAAGGGTTTCGACCATGTCGTCTAGCTTAATACCTCTGCGTCTCAGTTAAT 
AGTACGGGCAATCCGTTATGTAAAGGGTGACCACGTTTCAGAAGCTGCCATA 
30 TACTTACACACjCAGGCGATCACGTTAGATCCACTGCGTCACGTTACCTACATG 
ATCGATCCGATTACAGGCCGATCCATCGGATTACACACGAGTCCTGCACGTT 
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AGAACACTGGCTCGCGGCTTAGATCAGCTTCCCTCGCTGGAGATCGAATACG 
CCCAGCTWAGAGCGAATTGCGGCGCGTTCGACATAATTGCCGACGCTTCGAC 
AGAATTGTAGGCGATTCTAGCCAATTGCACGTCGTATTAGGTAGTCACTCTCG 
ACCTAGCGTAAGGATCCACGATCCTAGAGTCGGgtcgacccgggaattccggaaaaaaaaaaa 
5 aaaaaaziaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 

TagF660bp 

gcatgc aattaaccctcactaaagg gacgcgtacgtaagcttggatcctctagaACGCGGTCACTCAGCATAT 
AGTCGTTGCACCTAGTTGATAGTCGCCGATTCTAGTTATGGCGTCGGATTAGA 

10 CCGGATCACCCGGACATGGACGTTAAGTATCCGGCCTGGACGACAATAATTC 
GGCGGTGCCTCACAATATTCCGAGAACTCTGCATCAATTCGGGCTAGTCGTAC 
CTGAACGGGCATCAGTCGAATCTCTTCGTGGCTAGTCTGTGACGTCCGTGGTT 
CATCGTGTCACCACGCGGTACATGAGTCAAAGTCCGAATAGCTCGCGCAACG 
TCCGTCTAGCTGGATCAACCTATCCCTGAGTCTATATGCGTACCAATGGATGC 

1 5 GGTCTCCTCCGACTGAGTATGCGTTGCTCGGACTGGATC AGCTATCC ACGAGC 
TGTAATCCGGTACTAGGGTGTATCGCCTGTTACTAGGTTAGACAGTCGTGTAC 
TCGGTTAGACTGATGGTCAACGACCTATACTGACAGCATACGAGACGTGACG 
ACTGCATAGTGGTCGGTCTGACACATCTCCTCGTTGGTAGTACGTGCCCCGTA 
TGGATAGGGCTCTAGCCCGCTATGGTGAGTCTAATCGCCGTTGGTCTGTATGC 

20 AGTGCGGTATGGTTCCTCTCAGTCACGTATGGTTCGCTGCTGTCCGTCATGTG 
TTAGATGCgtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaaac tgcaggcgtaccagctttccc tatagtgag 
tcgtatta 

TagG 760bp 

25 gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaATGCAGCGTAGGTATCGAC 
TCTCACTGTGGAGTCGTCTATGATGTCGTGGAGTCCTCTCAGAGTGCTGTAGG 
TCCTCATAGGTCGTGCTGTCTCTCTACACGCGTGCGTGAGTCTACATTTCTGC 
GAGTTGGTGCTCTCACTGCGGTGTCAGTGATCTCTCCGCGTGTGACATGAGTC 
TAGCTTCGCGGTCATGGTCTATCCCAGCGATGGATGAGACTACTCTGTACTAG 

30 ATGGTCATGCCTGCGAATGAGTCGTCAGTGCCCACAATGTCTCGATAGTGCG 
CCGAATGTGTCTGTAATGCCTCGAATGTGTAATCGTCAACTCGTATGTGAAGT 
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GCTAGGCTAGTATTGACATCTACGGGCGGCTATTGACGAACTCTCCGGTATAT 
GCTCTACATCTGCAGGGAATTGCCGACCATATATGGGTCTTGCTGATACGCTA 
GGGTGCTTGCTACTTAGATAGGCGTCTTGGCCGCTATTCGCGGCGTGTCTCAG 
AATATGCGCGACGTGTCTGGTATATGGCGACTGTGTCCGTCTATACGCATACT 
5 GGTCCACATATAGACATACTTCCACGACATGACAAAGCGTGCTCCTACATAG 
CACGAGCGTCTCCTAAATAGATCCGGTCTTATCGCTGAATGTCTAGGATTCTC 
GTCAATGATCTACGATCCTCGCTAAGTATTCAGCCACCTCGTATAGTATTCGC 
GCACCTGAGGATTTATTCACCTGACTCGCGTATAATATGCCGTCACCTAGTCT 
Agtcgacccgggaattccg gaaaaaaaaaaaaaaaaaaaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 



TagH 848bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaGATATGCGTTACGTGAGTC 
TGATAGCAGTTCACTACCTGGATATCTGATCCACTAGCTCGATCATGCTCACC 
CATAGTTTATCTGCATCACTCGTACTGAAATGCTCACATCGCAGGTAGAGCAG 

15 CATCGTAGAGCGTCAAGCTGCATCGTAGCGTCATGAGTCATAGTACCTCATGC 
TCACGTGATCT ACCCTAGCTGACCGCTAATGACGGC AGTGC AACCTGAGATA 
GCGACGGCATACTGTCGTCAACGTCAGGCAATGTGTCCGAACGGCGAGCTAC 
GTCGCCTCACGGAGTAATCGCGTCCCTCTAGGTATAGTGCCGTCGGTTCAGGT 
CATATGTCGCGGGTTCTGCACATATCACGGACGTATCGCTATCAGACGGACG 

20 CTCTCGGACCTAAACCGTAGCTCTCGGCAAGATCGTCCTCGTCTCGAATATAG 
CGCCCTAGTGCTGCAAATGTCACCGCTATCTCGTAAGGGGTCCGTCTGTTGAG 
TTAGGCCTCCTCTCGTTGGATGTGAGCTCGGTTGCTTGGATGGTGCAGCTTAC 
TTCGCGTACCTGCTGTTTGCATCAGTCCTCTGCATCTATAATCGCGTATCTCTC 
TCTAGTAGACCATATAGCCATCTAAGCGCTCGATATTCCACCTAAGTGGCGCC 

25 TATTGAACTAAGTGGCAGCCGAATGGACTATCGCTCCTCGATATGTACGGAT 
AGGCCACGGCATGTACGAGCATAAGCCGAACTGCACGAGCATACCCGACACT 
GATCTGAGAGTCGCTTAAATCATCTGCGTGTCTTAGAGCTTATCGCCATGTCT 
GTCAACTGTACTGTCATCCTGTAACTGTAGCGTATGTGgtcgacccgggaattccggaaaa 



10 




30 



TagI 940bp 
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gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaGATAAGCGTTCACAGCTCG 

gcaatacctgtgacgagctgctcgcaagatttacgcagtgtggctatacttg 

acagtgatggcgcttacttcagatgtatgggtgatacttcgctatatgggtgg 

tcacttctctatggcgcgtgacaatgtactatggagcggtcaatgtcagtacg 

5 gatggcgtcgatctaggtgactacgcacgcctctggagtaaatcgarwgctc 
cgtck:gaaatacgcggtcatcgtgcgaataaccgagtcatcgtgagtagtat 
gaacgtgtcgtgttatgcagcggtatgtcgtgctataatggcgtctgtcgtgc 
tcataaggttcctctgatgtgctagacgtgtccatcgagctgcatagctatac 
ttcgagtcacttgggatacttcgatagcgttgtgaatagtgtcgtaggctctc 

10 gggcacgttgytaaactgttgccgccaattcaagattagtccagctcgtacta 
tcgaatacaccatcgtcgtatcgaataatcgcacctcgtaggagtcagttgcc 
actcgttgatagtcaaccaagctcgttagatagtagcccagatcctacgaga 
tgagctacgtaactacagtgatagcatatagggtacgctagaatgccaggtc 
gtagtcgaattagtcaggttggatgtctactagttgacttggagtatgccatg 

1 5 aagactcgtccctcgatatc aatactcgtccgcaggtgaac actgtagtcggt 
gctagtgcccacttctcggtatgtgtcctcaattatcgagtaggattctaatc 
aatcgtcgcggctcactaatygtctgcggtggctactaatggttacggtgcct 
gactaatcgtgtaggtgtgtaatacatcgtgatacgggcgatataatgctcg 



20 tgcaggcgtaccagctttcc ctatagtgagtcgtatta 
TagJ960bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaCAATGATAGGCTAGTCTCG 
CGCAGTACATGGTAGTTCAGCCAATAGATGCCTAGTACGCTGACGGCATTCA 

25 GAGTACGCTGATCGGCTTATGACGTATGTGACGCAGCTCTTAGCGCAATGTAT 
GTGCTGTTATCGAAGCCTATGGCTGAGTATGTAACGCTATGGCGTGCTAGTCG 
TCTCATATACGTCTGATGACCTCGTATCATGTTATAGGGCTGCGAACTGTCGA 
TGATGGTCACGACTCTGTCGATAGCTGTGTGACTCATTCAGAAGGTGTGCAGC 
CTATATGATACGCAGTCGCATCCTATCTTACGTGTCAGTACTATGTGTGAGTG 

30 CTCCGCCCTAGTGCTGATGTATGCCCCATAGTGCTCAGTGGAGTCTCTCTTAG 
CATAGTGTCCGCTCATACATTAGATGGACGGCTCATTAGTATCATCGTCGGCT 
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GATATAGGTCGTGGCTCCCTGTATATCGAGGTGAGTCTATCTGGATCAACGTC 
GCACTATGATGTGCAAAGTGTCGTCCATGTATAGACAGTGCGCGTATCATAT 
AGGATGCGGCGATCTCATACAGCGTTACGGTCGCTGCGTACTGTATAAGGAT 
GCTCTGTGAACTGTCATCGGTCCGATCAATTAGTCTAGTGTGCGTTATTCAGA 
5 TCGAGTGAGTACATGATTCGTCAGTGTGGATCAATTACAGTTAGGCCGCTGA 
CACATTAGTAACGTCGGCAAGCACTTAGTCGTGTCGTAAGCCAGTGTGTCGT 
GTCTTAGACGACTGTGTGTGATTCTCGAGCGATTTATACATCCGTGACAGCGT 
TTATAGTGTGCTGACAGACTGGTTGGTTATCCAATGATCGACCTGGAGTCTAA 
TATCTGACCACGCCTTGTAATCGTATGACACGCGCTTGACACGACTGAATCCA 
1 0 GCTTAAGAGCCCTGCAACGCGATATAC AGGCGCTGCT ACCGATATgtcgacccggg 
aattccg gaaaaaaaaaaaaaaaaaaaaac tgcaggcgtaccagctttccc tatagtgagtcgtatta 

TagN998bp 

gcatg caattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaAGATCGCAGGGTATCGCAT 

1 5 CGACAGACCTGGTATCGTCGTGACGAACGTGCTACTCGCTTATCGGGCCTGCT 
ACATCAGTGGCGATGTTCGTAACCCTTAGCCGATCTTCTTACTTACGAGGCTA 
CTATTCGATCAAACTCGCCTATCTGGTAATAACTGCGGTGATCTGGTAGCCAC 
TACGTGCGCCTGGTAGCAAATACGGCGAGCTGGTATCACTATCGGCTCAGTG 
GTCCGACATAGTGCCCAGTGGTTCGCATAACTGCCGCTGGGTCCAATATAAC 

20 ACGCAGTCGTCAATCATACGAGCCGATGGTCAGCAATAGCGCCTGTGGTGAC 
ACTATGCCACCTCTGGTCTAATATAGCGCCCTGTGGTCGTATAATCGAGCGCG 
TAATCGTATATYCGACTGTAGGTGCGTAACTCGCGACTAGGTGGCTCTAATCT 
GCGTTGGTTGTCGCTCACAGTGTCTGGTGTTCGATACCCGGATCGGGTTCCGT 
AATCTTGGCATCGAGGTTTCGTACATGTCACGCGGTCTCGTTCATTCTCGGTG 

25 GTGCTCAGTACATCCAGTGGTGAGTCGCTACATCACACGGTGATCCGGCTAA 
ACCTCTGGGCATCCGTATTAAGCGACATTCCTACGACTTATCAGCACGTCCTA 
CGGTATAACAAGGCGTGCTACGGTCTAACGACGCTGGTAGCAGTCTATCAGA 
TCGCTAGTACGAGTTAGAGATGCTTAGTACGCCTTCGAATCTATGATGCTCGT 
GCTCACGCGATGCACTCGGATTATGGCACATGCACTCGCGTAATGACGCTGC 

30 ATCGCTCAGTATGATCCATGAGCGCCGTGAATGACGCATGAGCCTCGTATCG 
AGTGCATGAGCTGTCTTTCACATGATACATCGCTCTAAATCATCATGCGACAG 
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TCTCGACAGCAGCTCAGCATCTATGCATCATGTGCCTCACTAGGACATCATGC 
TCGACTCTGAGACACTGATCGAGCATTAAGACgtcgacccgggaattccg gaaaaaaaaaaaaa 
aaaaaaaac tgcaegcgtaccaectttccc tataeteagtcgtatta 

5 TagO 998bp 

gcatg caattaaccctcactaaagae acgcgtacgtaagcttggatcctctagaCTCTGTGTCATGATCGTGA 
GTTGTCGCAGTGTCTGTACCAATACTCTGGTGGAGCTATATAAGCCGCTGTTO 
CGTAAATCAACGGCATGATCCCTATGACCGCGTCATGCTAACTGATACACGC 
TGCTCGAACAGTGATACGCACACTGATAACTATGCGCAGACGCTTGAAACGA 

1 0 TGTGAC ATCGCTTCT AGAGTATG AGCCGCAATGC ACGACTGATACTCGATAT 
GAGCAGCAGTCGGCTATGATTTGCAATGCTTGCAGTATGTATCCTGATCGTGC 
GTGCGATGTCTGATAATACGCTCGCATGATATGTATTGCGCTCAGATGCTGGA 
GATATGCCATGCGTGGTGTCAGTATGCCATGTATGCTGATATGTCGCGATCTA 
TGTGGTGACTATGAGATCCATGTGATGACGTTGCAGTCTCTGTGACCTTATCG 

1 5 ACGCGC ATGTGAGCCTAT AGAC AGCGATGTGAGCACTCTC ATCTGCGGATC A 
GTCTATCCTCGCTGATGCTCAGTGATACACGCTGATGCACGTAGTGAGCATCC 
TGTGCTCGCATATACCGCTGCTGCACTGATATGAGCCAGTGCTGCTGCTCTCT 
■ ACGGAGTGTGCTCGGCTATAACAGCGAGTGCTACGCCTAAACTGGCTGTCTA 
GCACTGTAGCTGGTGCATGTACTCGACTGCCGCTGCATCTACTATAAGACTCT 

20 GACATTAGCGTATAGGCTGATACATTAGCTCGGATGCTATCAGCTTGCGCCTA 
TTATATGCCTGACGCGGGATCTATCAGAACGACTCGGTAGCTCATATACTGG 
ATCACGGTGCCACAACATGCTACACGAGGTCTCAGACTCTATCCCGTGGACT 
CAACGTGCATCTGCTATGCTGAGCGCGTATCTGTGTACCTGTCCGATGCTCTG 
ATCTACACTGCCGTGATCGTTATATGACGAGACTGTGCGCTCATAGCCGACAC 

25 TGTGCTCGATAAGACCACGCTGTGCGGATATAgtcgacccgggaattccggaaaaaaaaaaaaa 
aaaaaaaac tgcaggcgtaccagctttccc tatagtgagtcgtatta 

TagQlOOObp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttggatcctctagaCTAGTGCATCCTCGTGGCA 
30 TCATGCGTCTCCTCAGTAGGTCTGCGACTGATCCTAGTGCAATGCGTCTGAGC 
CTGAGCTACAGCGATATAGCCTGGATTGTGAGCGTATTTGCTGTCAGAACCTC 
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AGCTCATCATGTATGATGCTGTACCATCCTGCGATACTGAAGATGCACCGCTA 

TAATGCGAGGCTCTCCGCTAAAGTGGAAGCTGCTCGTTCTCAATGCGAGCGA 

GTCGAATCCAATGCCGTAGCTGCGATAAGGATGCCGCTGACTCTACGGTAAT 

gcacgatcctctacattgatagcagatagtctaacgggatagcataggtgca 
5 aggctcctagcatgtaqtcacaggtgctc agatatagtcatcgctgcaatca 
gctagtcatcttgtcaggatgctactcactgcgtgcagaagattcgcacgact 
tcagaggatggcactcgtcattagagtgatgttctcggatcgacagtgctggt 
ctgcgaatgactcgcattcactaacatggagcatcgttatctaaaggggatg 
cacgttatcgtcgagtggccgtcatgtctatgcagtgcggcctatgtctcatt 
1 0 agcgagtcgtatgtatc atgtcgggctcgaatgttgc ac acgtctgggtaatg 
gtgaccgctagtcccasatggtgcttcgtagccacaaatgtcgttaggtagac 
cgacgttatcgcgctatacccgatgtcaacgcgagttagaccgtatcgtcccc 
agtgccctaagatggtcaagcgtgctcctacgttagtatcagtttccctattg 
gtacgtctggcgtacttctgaaacgtgatgggcggctggttacccgtatatgg 
1 5 gctcggttgacctctattgggcgttgttgacccgaattcggtatcctcgtcgt 
taaatggcgaacgtcgtctgctataggcaaacgtctgtcggtcatggcaaat 

GTTACTCGTGTGTGCAAGAAATTACTCGCTGTCgtcgacccgggaattccggaaaaaaaaaaaa 
aaaaaaaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 

20 TagIN1944bp 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttGATAAGCGTTCACAGCTCGGCAATAC 
CTGTGACGAGCTGCTCGCAAGATTTACGCAGTGTGGCTATACTTGACAGTGAT 
GGCGCTTACTTCAGATGTATGGGTGATACTTCGCTATATGGGTGGTCACTTGT 
CTATGGCGCGTGACAATGTACTATGGAGCGGTCAATGTCAGTACGGATCGCG 

25 TCGATCTAGGTGACTACGCACGCCTCTGGAGTAAATCGAGTGCTCCGTGCGA 
AATACGCGGTCATCGTGCGAATAACCGAGTCATCGTGAGTAGTATGAACGTG 
TCGTGTTATGCAGCGGTATGTCGTGCTATAATGGCGTCTGTCGTGCTCATAAG 
GTTCCTCTGATGTGCTAGACGTGTCCATCGAGCTGCATAGCTATACTTCGAGT 
CACTTGGGATACTTCGATAGCGTTGTGAATAGTGTCGTAGGCTCTCGGGCACG 

30 TTGTTAAACTGTTGCCGCCAATTCAAGATTAGTCCAGCTCGTACTATCGAATA 
CACGATCGTCGTATGGAATAATCGCACCTCGTAGGAGTCAGTTGCCACTCGTT 
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GATAGTCAACCAAGCTCGTTAGATAGTAGCCCAGATCCTACGAGATGAGCTA 
CGTAACTACAGTGATAGCATATAGGGTAeGCTAGAATGCCAGGTCGTAGTCG 
AATTAGTCAGGTTGGATGTCTACTAGTTGACTTGGAGTATGCCATGAAGACTC 
GTCCCTCGATATCAATACTCGTCCGCAGGTGAACACTGTAGTCGGTGCTAGTG 
5 CCCACTTCTCGGTATGTGTCCTCAATTATCGAGTAGGATTCTAATCAATCGTC 
GCGGCtCACTAATTGTCTGCGGtGGCTACTAATGGTTACGGTGCCTGACTAAT 
CGTGTAGGTGTCTAATACATCGTGATACGGGCGATATAATGCTCGATACGGC 
AAATATAGCTCCGTCCGGTGGATCCAGATCGCAGGGTATCGCATCGACAGAC 
CTGGTATCGTCGTGACGAACGTGCTACTCGCTTATCGGGCCTGCTACATCAGT 

1 0 GGCGATGTTGGTAACCCTTAGCCGATCTTCTTACTTACGAGGCTACTATTCGA 
TCAAACTCGCCTATCTGGTAATAACTGCGGTGATCTGGTAGCCACTACGTGCG 
CCTGGTAGCAAATACGGCGAGCTGGTATCACTATCGGCTCAGTGGTCCGACA 
TAGTGCCCAGTGGTTCGCATAACTGCCGCTGGGTCCAATATAACACGCAGTC 
GTCAATCATACGAGCCGATGGTCAGCAATAGCGCCTGTGGTGACACTATGCC 

1 5 ACCTCTGGTCTAATATAGCGCCCTGTGGTCGTATAATCGAGCGCGTAATCGTA 
TATCCGACTGTAGGTGCGTAACTCGCGACTAGGTGGCTCTAATCTGCGTTGGT 
TGTCGCTCACAGTGTCTGGTGTTCGATACCCGGATCGGGTTCCGTAATCTTGG 
CATCGAGGTTTCGTACATGTCACGCGGTCTCGTTCATTCTCGGTGGTGCTCAG 
TACATCCAGTGGTGAGTCGCTACATCACACGGTGATCCGGCTAAACCTCTGG 

20 GCATCCGTATTAAGCGACATTCCTACGACTTATCAGCACGTCCTACGGTATAA 
CAAGGCGTGCTACGGTCTAACGACGCTGGTAGCAGTCTATCAGATCGCTAGT 
ACGAGTTAGAGATGCTTAGTACGCCTTCGAATCTATGATGCTCGTGCTCACGC 
GATGCACTCGGATTATGGCACATGCACTCGCGTAATGACGCTGCATCGCTCA 
GTATGATCCATGAGCGCCGTGAATGACGCATGAGCCTCGTATCGAGTGCATG 

25 AGCTGTCTTTCACATGATACATCGCTCTAAATCATCATGCGACAGTCTCGACA 
GCAGCTCAGCATCTATGCATCATGTGCCTCACTAGGACATCATGCTCGACTCT 
GAGACACTGATCGAGCATTAAGACtctagagcggccgccgactagtgagctcgtcgaccccgggaatt 
ccg gaaaaaaaaaaaaaaaaaaaaa ctgcaggcgtaccagctttccc tatagtgagtcgtatta 

30 TaglQ (INOQ) 3849bp 
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gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttGATAAGCGTTCACAGCTCGGCAATAC 

CTGTGACGAGCTGCTCGCAAGATTTACGCAGTGTGGCTATACTTGACAGTGAT 

GGCGCTTACTTCAGATGTATGGGTGATACTTCGCTATATGGGTGGTCACTTCT 

CTATGGCGCGTGACAATGTACTATGGAGCGGTCAATGTCAGTACGGATCGCG 
5 TCGATCTAGGTGACTACGCACGCCTCTGGAGTAAATCGAGTGCTCCGTGCGA 
AATACGCGGTCATCGTGCGAATAACCGAGTCATCGTGAGTAGTATGAACGTG 
TCGTGTTATGCAGCGGTATGTCGTGCTATAATGGCGTCTGTCGTGCTCATAAG 
GTTCCTCTGATGTGCTAGACGTGTCCATCGAGCTGCATAGCTATACTTCGAGT 
CACTTGGGATACTTCGATAGCGTTGTGAATAGTGTCGTAGGCTCTCGGGCACG 
10 TTGTTAAACTGTTGCCGCCAATTCAAGATTAGTCCAGCTCGTACTATCGAATA 
CACCATCGTCGTATCGAATAATCGCACCTCGTAGGAGTCAGTTGCCACTCGTT 

GATAGTCAACCAAGCTCGTTAGATAGTAGCCCAGATCCTACGAGATGAGCTA 
CGTAACTACAGTGATAGCATATAGGGTACGCTAGAATGCCAGGTCGTAGTCG 
AATTAGTCAGGTTGGATGTCTACTAGTTGACTTGGAGTATGCCATGAAGACTC 

1 5 GTCCCTCGATATCAATACTCGTCCGCAGGTGAACACTGTAGTCGGTGCTAGTG 
CCCACTTCTCGGTATGTGTCCTCAATTATCGAGTAGGATTCTAATCAATCGTC 
GCGGCTCACTAATTGTCTGCGGTGGCTACTAATGGTTACGGTGCCTGACTAAT 
CGTGTAGGTGTCTAATACATCGTGATACGGGCGATATAATGCTCGATACGGC 
AAATATAGCTCCGTCCGGTGGATCCAGATCGCAGGGTATCGCATCGACAGAC 

20 CTGGTATCGTCGTGACGAACGTGCTACTCGCTTATCGGGCCTGCTACATCAGT 
GGCGATGTTCGTAACCCTTAGCCGATCTTCTTACTTACGAGGCTACTATTCGA 
TCAAACTCGCCTATCTGGTAATAACTGCGGTGATCTGGTAGCCACTACGTGCG 
CCTGGTAGCAAATACGGCGAGCTGGTATCACTATCGGCTCAGTGGTCCGACA 
TAGTGCCCAGTGGTTCGCATAACTGCCGCTGGGTCCAATATAACACGCAGTC 

25 GTCAATCATACGAGCCGATGGTCAGCAATAGCGCCTGTGGTGACACTATGCC 
ACCTCTGGTCTAATATAGCGCCCTGTGGTCGTATAATCGAGCGCGTAATCGTA 
TATCCGACTGTAGGTGCGTAACTCGCGACTAGGTGGCTCTAATCTGCGTTGGT 
TGTCGCTCACAGTGTCTGGTGTTCGATACCCGGATCGGGTTCCGTAATCTTGG 
CATCGAGGTTTCGTACATGTCACGCGGTCTCGTTCATTCTCGGTGGTGCTCAG 

30 TACATCCAGTGGTGAGTCGCTACATCACACGGTGATCCGGCTAAACCTCTGG 
GCATCCGTATTAAGCGACATTCCTACGACTTATCAGCACGTCCTACGGTATAA 
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CAAGGCGTGCTACGGTCTAACGACGCTGGTAGCAGTCTATCAGATCGCTAGT 
ACGAGTTAGAGATGCTTAGTACGCCTTCGAATCTATGATGCTCGTGCTCACGC 
GATGCACTCGGATTATGGCACATGCACTCGCGTAATGACGCTGCATCGCTCA 
GTATGATCCATGAGCGCCGTGAATGACGCATGAGCCTCGTATCGAGTGCATG 
5 AGCTGTCTTTCACATGATACATCGCTCTAAATCATCATGCGACAGTCTCGACA 
GCAGCTCAGCATCTATGCATCATGTGCCTCACTAGGACATCATGCTCGACTCT 
GAGACACTGATCGAGCATTAAGACTCTAGACTCTGTGCCATGATCGTGAGTT 
GTCGCAGTGTCTGTACCAATACTCTGGTGGAGCTATATAAGCCGCTGTTGCGT 
AAATCAACGGCATGATCCCTATGACCGCGTCATGCTAACTGATACACGCTGC 

1 0 TCGAACAGTGATACGCACACTGATAACTATGCGCAGACGCTTGAAACGATGT 
GACATCGCTTCTAGAGTATGAGCCGCAATGCACGACTGATACTCGATATGAG 
CAGCAGTCGGCTATGATTTGCAATGCTTGCAGTATGTATCCTGATCGTGCGTG 
CGATGTCTGATAATACGCTCGCATGATATGTATTGCGCTCAGATGCTGGAGAT 
ATGCCATGCGTGCTGTCAGTATGCCATGTATGCTGATATGTCGCGATCTATGT 

1 5 GGTGACTATGAGATCC ATGTGATGACGTTGC AGTCTCTGTGACCTTATCGACG 
CGCATGTGAGCCTATAGACAGCGATGTGAGCACTCTCATCTGCGGATCAGTC 
TATCCTCGCTGATGCTCAGTGATACACGCTGATGCACGTAGTGAGCATCCTGT 
GCTCGCATATACCGCTGCTGCACTGATATGAGCCAGTGCTGCTGCTCTCTACG 
GAGTGTGCTCGGCTATAACAGCGAGTGCTACGCCTAAACTGGCTGTCTAGCA 

20 CTGTAGCTGGTGCATGTACTCGACTGCCGCTGCATCTACTATAAGACTCTGAC 
ATTAGCGTATAGGCTGATACATTAGCTCGGATGCTATCAGCTTGCGCCTATTA 
TATGCCTGACGCGGGATCTATCAGAACGACTCGGTAGCTCATATACTGGATC 
ACGGTGCCACAACATGCTACACGAGGTCTCAGACTCTATCCCGTGGACTCAA 
CGTGCATCTGCTATGCTGAGCGCGTATCTGTGTACCTGTCCGATGCTCTGATC 

25 TACACTGCCGTGATCGTTATATGACGAGACTGTGCGCTCATAGCCGACACTGT 
GCTCGATAAGACCACGCTGTGCGGATATAGTCGACCTAGTGCATCCTCGTGG 
CATCATGCGTCTCCTCAGTAGGTCTGCGACTGATCCTAGTGCAATGCGTCTGA 
GCCTGAGCTACAGCGATATAGCCTGGATTGTGAGCGTATTTGCTGTGAGAAC 
CTCAGCTCATCATGTATGATGCTGTACCATCCTGCGATACTGAAGATGCACCG 

30 CTATAATGCGAGGCTCTCCGCTAAAGTGGAAGCTGCTCGTTCTCAATGCGAG 
CGAGTCGAATCCAATGCCGTAGCTGCGATAACGATGCCGCTGACTCTACGGT 
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AATGCACGATCCTCTACATTGATAGCAGATAGTCTAACGGGATAGCATAGGT 
GCAAGGCTCCTAGCATGTAGTCACAGGTGCTCAGATATAGTCATCGCTGCAA 
TCAGCTAGTCATCTTGTCAGGATGCTACTCACTGCGTGCAGAAGATTCGCACG 
ACTTCAGAGGATGGCACTCGTCATTAGAGTGATGTTCTCGGATCGACACTGCT 
5 GGTCTGCGAATGACTCGCATTCACTAACATGGAGCATCGTTATCTAAAGGGG 
ATGCACGTTATCGTCGAGTGGCCGTCATGTCTATGCAGTGCGGCCTATGTCTC 
ATTAGCGAGTCGTATGTATCATGTCGGGCTCGAATGTTGCACACGTCTGCGTA 
ATGGTGACCGCTAGTCCCACATGGTGCTTCGTAGCCACAAATGTCGTTAGGTA 
GACCGACGTTATCGCGCTATACCCGATGTCAACGCGAGTTAGACCGTATCGT 
1 0 CCCC AGTGCCCT AAGATGGTCAAGCGTGCTCCTACGTT AGTATC AGTTTCCCT 
ATTGGTACGTCTGGCGTACTTCTGAAACGTGATGGGCGGCTGGTTACCCGTAT 
ATGGGCTCGGTTGACCTCTATTGGGCGTTGTTGACCCeaattcc ggaaaaaaaaaaaaaaaa 
aaaaa ctgcaggcgtaccagctttcc ctatagtgagtcgtatta 

15 TagIQ.EX (3849 bp; the 2 bp differences frotn TaglQ are underlined and in bold) 

gcatgc aattaaccctcactaaaggg acgcgtacgtaagcttGATAAGCGTTCACAGCTCGGCAATAC 

ctgtgacgagctgctcgcaagatttacgcagtgtggctatacttgacagtgat 

ggcgcttacttcagatgtatgggtgatacttcgctatatgggtggtcacttct 

ctatggcgcgtgacaatgtaCtatggagcggtcaatgtcagtacggatcgcg 

20 tcgatctaggtgactacgcacgcctctggagtaaatcgagtgctccgtgcga 
aatacgcggtcatcgtgcgaataaccgagtcatcgtgagtagtatgaacgtg 
tcgtgttatgcagcggtatgtcgtgctataatggcgtctgtcgtgctcataag 
gttcctctgatgtgctagacgtgtccatcgagctgcatagctatacttcgagt 
cacttgggatacttcgatagcgttgtgaatagtgtcgtaggctctcgggcacg 

25 ttgttaaactgttgccgccaattcaagattagtccagctcgtactatcgaata 
caccatcgtcgtatcgaataatcgcacctcgtaggagtcagttgccactcgtt 
gatagtcaaccaagctcgttagatagtagcccagatcctacgagatgagcta 
cgtaactacagtgatagcatatagggtacgctagaatgccaggtcgtagtcg 
aattagtcaggttggatgtctactagttgacttggagtatgccatgaagactc 

30 gtccctcgatatcaatactcgtccgcaggtgaacactgtagtcggtgctagtg 
cccacttctcggtatgtgtcctcaattatcgagtaggattctaatcaatcgtc 
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GCGGCTCACTAATTGTCTGCGGTGGCTACTAATGGTTACGGTGCCTGACTAAT 
CGTGTAGGTGTCTAATACATCGTGATACGGGCGATATAATGCTCGATACGGC 
AAATATAGCTCCGTCCGGTGGATCCAGATCGCAGGGTATCGCATCGACAGAC 
CTGGTATCGTCGTGACGAACGTGCTACTCGCTTATCGGGCCTGCTACATCAGT 
5 GGCGATGTTCGTAACCCTTAGCCGATCTTCTTACTTACGAGGCTACTATTCGA 
TCAAACTCGCCTATCTGGTAATAACTGCGGTGATCTGGTAGCCACTACGTGCG 
CCTGGTAGCAAATACGGCGAGCTGGTATCACTATCGGCTCAGTGGTCCGACA 
TAGTGCCCAGTGGTTCGCATAACTGCCGCTGGGTCCAATATAACACGCAGTC 
GTCAATCATACGAGCCGATGGTCAGCAATAGCGCCTGTGGTGACACTATGCC 

1 0 ACCTCTGGTCT AATATAGCGCCCTGTGGTCGTAT AATCGAGCGCGTAATCGT A 
TATCCGACTGTAGGTGCGTAACTCGCGACTAGGTGGCTCTAATCTGCGTTGGT 
TGTCGCTCACAGTGTCTGGTGTTCGATACCCGGATCGGGTTCCGTAATCTTGG 
CATCGAGGTTTCGTACATGTCACGCGGTCTCGTTCATTCTCGGTGGTGCTCAG 
TACATCCAGTGGTGAGTCGCTACATCACACGGTGATCCGGCTAAACCTCTGG 

1 5 GC ATCCGTATTAAGCGACATTCCTACGACTTATGAGC ACGTCCTACGGTATAA 
CAAGGCGTGCTACGGTCTAACGACGCTGGTAGCAGTCTATCAGATCGeTAGT 
ACGAGTTAGAGATGCTTAGTACGCCTTCGAATCTATGATGCTCGTGCTCACGC 
GATGCACTCGGATTATGGCACATGCACTCGCGTAATGACGCTGCATCGCTCA 
GTATGATCCATGAGCGCCGTGAATGACGCATGAGCCTCGTATCGAGTGCATG 

20 AGCTGTCTTTCACATGATACATCGCTCTAAATCATCATGCGACAGTCTCGACA 
GCAGCTCAGCATCTATGCATCATGTGCCTCACTAGGACATCATGCTCGACTCT 
GAGACACTGATCGAGCATTAAGACTCTAGACTCTGTGCCATGATCGTGAGTT 
GTCGCAGTGTCTGTACCAATACTCTGGTGGAGCTATATAAGCCGCTGTTGCGT 
AAATCAACGGCATGATCCCTATGACCGCGTCATGCTAACTGATACACGCTGC 

25 TCGAACAGTGATACGCACACTGATAACTATGCGCAGACGCTTGAAACGATGT 
GACATCGCTTCTAGAGTATGAGCCGCAATGCACGACTGATACTCGATATGAG 
CAGCAGTCGGCTATGATTTGCAATGCTTGCAGTATGTATCCTGATCGTGCGTG 
CGATGTCTGATAATACGCTCGCATGATATGTATTGCGCTCAGATGCTGGAGAT 
ATGCCATGCGTGCTGTCAGTATGCCATGTATGCTGATATGTCGCGATCTATGT 

30 GGTGACTATGAGATCCATGTGATGACGTTGCAGTCTCTGTGACCTTATCGACG 
CGCATGTGAGCCTATAGACAGCGATGTGAGCACTCTCATCTGCGGATCAGTC 
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TATCCTCGCTGATGCTCAGTGATACACGCTGATGCACGTAGTGAGCATCCTGT 

gctcgcatataccgctgctgcactgatatgagccagtgctgctgctctctacg 
gagtgtgctcggctataacagcgagtgctacgcctaaactggctgtctagaa 
ctgtagctggtgcatgtactcgactgccgctgcatctactataagactgtgac 

5 attagcgtataggctgatacattaggtcggatgctatcagcttgcgcctatta 
tatgcctgacgcgggatctatcagaacgactcggtagctcatatactggatc 
acggtgccacaacatgctacacgaggtctcagactctatcccgtggactcaa 
cgtgcatctgctatgctgagcgcgtatctgtgtacctgtccgatgctctgatc 
tacactgccgtgatcgttatatgacgagactgtgcgctcatagccgacactgt 

1 0 gctcgataagaccacgctgtgcggatatagtcgacctagtgcatcctcgtgg 
catcatgcgtctcctcagtaggtctgcgactgatcctagtgcaatgcgtctga 
gcctgagctacagcgatatagcctggattgtgagcgtatttgctgtcagaac 
ctcagctcatcatgtatgatgctgtaccatcctgcgatactgaagatgcaccg 
ctataatgcgaggctctccgctaaagtggaagctgctcgttctcaatgcgag 

1 5 cgagtcgaattcaatgccgtagctgcgataacgatgccgctgactctacggt 
aatgcacgatcctctacattgatagcagatagtctaacgggatagcataggt 
gcaaggctcctagcatgtagtcacaggtgctcagatatagtcatcgctgcaa 
tcagctagtcatcttgtcaggatgctactcactgcgtgcagaagattcgcacg 
acttcagaggatggcactcgtcattagagtgatgttctcggatcgacactgct 

20 ggtctgcgaatgactcgcattcactaacatggagcatcgttatctaaagggg 
atgcacgttatcgtcgagtggccgtcatgtctatgcagtgcggcctatgtctc 
attagcgagtcgtatgtatcatgtcgggctcgaatgttgcacacgtctgcgta 
atggtgagcgctagtcccacatggtgcttcgtagccacaaatgtcgttaggta 
gaccgacgttatcgcgctatacccgatgtcaacgcgagttagaccgtatcgt 

25 ccccagtgccctaagatggtcaagcgtgctcctacgttagtatcagtttccct 
attggtacgtctggcgtacttctgaaacgtgatgggcggctggttacccgtat 

ATGGGCTCGGTTGACCTCTATTGGGCGTTGTTGACCCgaattccggaaaaaaaaaaaaaaaa 
aaaaac tgcaggcgtaccagctttcc ctatagtgagtcgtatta 

Example 2 

30 Testing the Tag genes 
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The synthetic genes were tested in a number of ways. 1) An oHgonucleotide array 
was designed and made to probe many positions along the length of each Tag gene. 
Hybridizing RNA made from the Tag genes clearly shows the expected uniform 
hybridization both across each gene and between the 13 genes, a uniformity that is 
5 lacking from naturally occurring genes. This uniformity is expected because the Tags are 
originally designed for such characteristic. 

In addition, the average signal from the Tag genes is higher than the signal frona 
transcripts from human genes spiked in at equivalent concentrations. Data from these 
experiments are used to help develop new probe selection rules and new gene expression 

10 algorithms. 2) Probe sets for the Tag genes are included on the Affymetrix HG_U133 
human gene expression arrays (Affymetrix, Inc., Santa Clara, CA). Tag gene RNA 
spikes are used to help validate the array design. Again the Tag gene transcripts 
demonstrate consistent hybridization and high signal intensity. 3) The plasmid 
containing the longest Tag gene construct, pTaglQ, contains 3849 bp of Tag sequence 

15 (Tags I, N, O, and most of Q). This plasmid may be used for genotyping applications. 

For variant detection (resequencing) assays, the plasmid may be used as a template to test 
long-range PCR (Figure 4) and the PGR product from this plasmid can be labeled and 
hybridized to test other steps of the assay. For microarray SNP analysis, TaglQ.EX 
(Figure 5) can serve as an assay control. One sample preparation method calls for 

20 digesting genomic DNA with a restriction endonuclease and then preferentially 

amplifying fragments of a particular size range, 400-800 bp, for example. TaglQ.EX can 
be added to the test DNA, and then digested with Xbal or EcoRI, amplified, labeled, and 
hybridized along with the test DNA. The results of the Tag sequence can be used to 
assess system performance. 4) RNA spikes from Tag genes have been used as exogenous 

25 controls in quantitative RT-PCR experiments. These spikes can be used to normalize 

quantitative RT-PCR to aid in determining absolute transcript levels. In addition, the Tag 
gene spikes can also allow direct coniparispns between microarray and RT-PCR results, 

(Si 

or between different types of microarrays (spotted arrays vs. GeneChip arrays 
(Affymetrix, Inc., Santa Clara, CA), for example). The universal absence of the synthetic 
30 genes will also allow comparisons between different sample types; for example, data 
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from microarray and RT-PCR experiments can be normalized for samples from mouse, 
human, and bacteria. 

An example of an application of the cloned Tag genes, is provided by the 
Affymetrix CustomSeq(TM) resequencing arrays, which contain probes complementary 
5 to portions of both DNA strands of the TaglQ.EX sequence, as well as probes 
complementary to DNA derived from customer-specified genes or genomes. A 
GeneChip(R) Resequencing Assay Kit contaiiiing the TaglQ.EX plasmid and PCR 
primers is available from Affymetrix to amplify the relevant Tag DNA, and thus serveis 
as a control for the PCR process. Amplified Tag DNA can then serve as a control for 

10 fragmentation and labeling. Furthermore, because the Tag sequence was chosen to be 
absent from any genomic sample, cross-hybridization should be minimal between Tag- 
derived DNA and DNA derived from any genomic sample, so Tag DNA can be mixed 
. with DNA complementary to other probes on the resequencing arrays. Hybridization of 
the mixture to resequencing arrays provides a control of the hybridization and base- 

15 calling process. 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will 
be suggested to persons skilled in the art and are to be included within the spirit and 
purview of this application and scope of the appended claims. All publications, patents, 

20 and patent applications cited herein are hereby incorporated by references for all 
pxirposes. 
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